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(ii) TITLE OF THE INVENTION: A METHOD FOR INHIBITING IMMUNOGLOBULIN- 



TOXICITY RESULTING FROM THE USE OF IMMUNOGLOBULINS 
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(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 
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(B) STREET: P.O. Box 4000 
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(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: CD-ROM 
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(C) OPERATING SYSTEM: WINDOWS, 

-(D)- SOFTWARE: Patentln - - • - - - — ■ 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/905,293 

(B) FILING DATE: 01-AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/023,033 

(B) FILING DATE: 02-AUG-1996 



(v.iii) ATTORNEY/ AGENT -INFORMATION: 

(A) NAME: Carey, Brian 

(B) REGISTRATION NUMBER: 44,590 

(C) REFERENCE/DOCKET NUMBER: ON0146A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 609-252-3883 
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( C ) TELEX : 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 



INDUCED 



(B) TYPE: nucleic acid 



BEST 




(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TGGCACCGAA AGCTTTCTGG GGCAGGCCAG GCCTGA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
TCCGGACATG TTGGTACCCA CGTGGTGGTC GACGCTGAGC CTGGCTTCGA GCAGACA 

(2) INFORMATION FOR SEQ 'ID NO': 3 : 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 55 base pairs 

(B) TYPE:- nucleic acid . - ■ - . . 

"(C) STRANDEDNESS: single — -- . - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA ... 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GTCGACCACC ACGTGGGTAC CAACATGTCC GGAGCCACAT GGACAGAGGC CGGCT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTGGTTCTTG TTCATCTCCT CTCTAGATGG 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ACCATGGTCG ACCTCAGACC TGCCAAGAGC CATATC 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CATGGTCACG TGGTGTGTCC CTGGATGCAG GCTACTCTAG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) -. -STRANDEDNESS: single 
- "(D) -TOPOLOGY: -linear — - ; 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : ' 
CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG GCTCAGCGCT GACCTCAGA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGAAAGAACC ATCACAGTCT CGCAGGGGCC CAGGGCAGCG CTGGGTGCTT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GACGGATCGG GAGATCTGCT AGGTGACCTG AGGCGCGCCG GCTTCGAATA GCCAGAGTAA 60 

CCTTTTTTTT TAATTTTATT TTATTTTATT TTTGAGATGG AGTTTGGCGC CGATCTCCCG 120 

ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGTATC 180 

TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT TAAGCTACAA 240 

CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG CGTTTTGCGC 300 

TGCTTCGCGA TGTACGGGCC AG AT AT AC GC GTTGACATTG ATTATTGACT AGTTATTAAT 360 

AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 420 

TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 480 

TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGACT 540 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC 600 

CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT 660 

GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 720 

GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 7 80 

TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 840 

AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG 900 

TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTACTGG CTTATCGAAA 960 

TTAATACGAC TCACTATAGG GAGACCCAAG CTTGGTACCA ATTTAAATTG ATATCTCCTT 1020 

AGGTCTCGAG TCTCTAGATA ACCGGTCAAT CGATTGGAAT TCTTGCGGCC GCTTGCTAGC 1080 

CACCATGGAG TTGTGGTTAA GCTTGGTCCT TCCTTGTCCT TGTTTTAAAA GGTGTCCAGT 1140 

GTGAAGTGAA TCTGGTGGAG TCTGGGGGAG GCTTAGTGCA GCCTGGAGGG TCCCTGAAAG 12 00 

TCTCCTGTGT AACCTCTGGA TTCACTTTCA GTGACTATTA CATGTATTGG GTTCGCCAGA 12 60 

CTCCAGAGAA GAGGCTGGAG TGGGTCGCAT ACATTAGTCA AGGTGGTGAT ATAACCGACT 132 0 
■ 1 ATCCAGACAC ^TGTAAAGGGT CGATTCACCA TCTCCAGAGA CAATGCCAAG AACACCCTGT ■ 13.80 

' : • ACCTGCAAAT* GAGCCGTCTG AAGTCTGAGG ACACAGCCAT GTATTACTGT -GCAAGAGGCC 1-440 

TGGACGACGG GGCCTGGTTT GCTTACTGGG GCCAAGGGAC TCTGGTCACG GTCTCTGTAG 1500 

"CTAGCACCAA GGGCCCATCG GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG 1560 

GCACAGCGGC CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG . ACGGTGTCGT 1.620 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA CAGTCCTCAG 1680 

GACTCTACTC CCTCAGCAGC GTGGTCACCG TGCCCTCCAG CAGCTTGGGC ACCCAGACCT 1740 

ACATCTGCAA CGTGAATCAC AAGCCCAGCA ACACCAAGGT GGACAAGAAA GTTGGTGAGA 1800 

GGCCAGCACA GGGAGGGAGG GTGTCTGCTG GAAGCCAGGC TCAGCGCTCC TGCCTGGACG 1860 

CATCCCGGCT ATGCAGCCCC AGTCCAGGGC AGCAAGGCAG GCCCCGTCTG CCTCTTCACC 1920 

CGGAGGCCTC TGCCCGCCCC ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCCCAG 1980 

GCTCTGGGCA GGCACAGGCT AGGTGCCCCT AACCCAGGCC CTGCACACAA AGGGGCAGGT 2040 

GCTGGGCTCA GACCTGCCAA GAGCCATATC CGGGAGGACC CTGCCCCTGA CCTAAGCCCA 2100 

CCCCAAAGGC CAAACTCTCC ACTCCCTCAG CTCGGACACC TTCTCTCCTC CCAGATTCCA 2160 

GTAACTCCCA ATCTTCTCTC TGCAGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA 2220 

CCGTGCCCAG GTAAGCCAGC CCAGGCCTCG CCCTCCAGCT CAAGGCGGGA CAGGTGCCCT 2280 

AGAGTAGCCT GCATCCAGGG ACAGGCCCCA GCCGGGTGCT GACACGTCCA CCTCCATCTC 2340 

TTCCTCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC CTCTTCCCCC CAAAACCCAA 2400 

GGACACCCTC ATGATCTCCC GGACCCCTGA GGTCACATGC GTGGTGGTGG ACGTGAGCCA 2460 

CGAAGACCCT GAGGTCAAGT TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGCCAA 252 0 

GACAAAGCCG CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2580 

CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA ACAAAGCCCT 2640 

CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGT GGGACCCGTG GGGTGCGAGG 2700 

GCCACATGGA CAGAGGCCGG CTCGGCCCAC CCTCTGCCCT GAGAGTGACC GCTGTACCAA 2760 

CCTCTGTCCC TACAGGGCAG CCCCGAGAAC CACAGGTGTA CACCCTGCCC CCATCCCGGG 2820 

ATGAGCTGAC CAAGAACCAG GTCAGCCTGA CCTGCCTGGT CAAAGGCTTC TATCCCAGCG 2880 

ACATCGCCGT GGAGTGGGAG AGCAATGGGC AGCCGGAGAA CAACTACAAG ACCACGCCTC 2940 

CCGTGCTGGA CTCCGACGGC TCCTTCTTCC TCTACAGCAA GCTCACCGTG GACAAGAGCA 3000 

GGTGGCAGCA GGGGAACGTC TTCTCATGCT CCGTGATGCA TGAGGCTCTG CACAACCACT 3060 

ACACGCAGAA GAGCCTCTCC CTGTCTCCGG GTAAATGAGT GCGACGGCCG GCAAGCCCCC 312 0 
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GCTCCCCGGG CTCTCGCGGT CGCACGAGGA TGCTTGGCAC GTACCCCCTG TACATACTTC 3180 

CCGGGCGCCC AGCATGGAAA TAAAGCACCC AGCGCTGCCC TGGGCCCCTG CGAGACTGTG 3240 

ATGGTTCTTT CCACGGGTCA GGCCGAGTCT GAGGCCTGAG TGGCATGAGG GAGGCAGAGC 3300 

GGGTCCCACT GTCCCCACAC TGGCCCAGGC TGTGCAGGTG TGCCTGGGCC CCCTAGGGTG 33 60 

GGGCTCAGCC AGGGGCTGCC CTCGGCAGGG TGGGGGATTT GCCAGCGTGG CCCTCCCTCC 3420 

AGCAGCACCT GCCCTGGGCT GGGCCACGGG AAGCCCTAGG AGCCCCTGGG GACAGACACA 3480 

CAGCCCCTGC CTCTGTAGGA GACTGTCCTG TTCTGTGAGC GCCCCTGTCC TCCCGACCTC 3540 

CATGCCCACT CGGGGGCATG CCTAGTCCAT GTGCGTAGGG ACAGGCCCTC CCTCACCCAT 3600 

CTACCCCCAC GGCACTAACC CCTGGCTGCC CTGCCCAGCC TCGCACCCGC ATGGGGACAC 3660 

AACCGACTCC GGGGACATGC ACTCTCGGGC CCTGTGGAGG GACTGGTGCA GATGCCCACA 3720 

CACACACTCA GCCCAGACCC GTTCAACAAA CCCCGCACTG AGGTTGGCCG GCCACACGGC 37 80 

CACCACACAC ACACGTGCAC GCCTCACACA CGGAGCCTCA CCCGGGCGAA CTGCACAGCA 3 840 

CCCAGACCAG AGCAAGGTCC TCGCACACGT GAACACTCCT CGGACACAGG CCCCCACGAG 3900 

CCCCACGCGG CACCTCAAGG CCCACGAGCC TCTCGGCAGC TTCTCCACAT GCTGACCTGC 3960 

TCAGACAAAC CCAGCCCTCC TCTCACAAGG GTGCCCCTGC AGCCGCCACA CACACACAGG 402 0 

GGATCACACA CCACGTCACG TCCCTGGCCC TGGCCCACTT CCCAGTGCCG CCCTTCCCTG 4080 

CAGGACGGAT CAGCCTCGAC TGTGCCTTCT AGTTGCCAGC CATCTGTTGT TTGCCCCTCC 4140 

CCCGTGCCTT CCTTGACCCT GGAAGGTGCC ACTCCCACTG TCCTTTCCTA ATAAAATGAG 42 00 

GAAATTGCAT CGCATTGTCT GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG 4260 

GACAGCAAGG GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 4320 

ATGGCTTCTG AGGCGGAAAG AACCAGCTGG GGCTCTAGGG GGTATCCCCA CGCGCCCTGT 4380 

AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA . GCGTGACCGC TACACTTGCC 4440 

AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC GTTCGCCGGG 4500 

CCTCTCAAAA AAGGGAAAAA AAGCATGCAT CTCAATTAGT CAGCAACCAT .AGTCCCGCCC 4560 

CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 4620 

TGACTAATTT TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 4680 

AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGG ACAGCTCAGG 4740 ._ : 

- GCTGCGATTT CGCGCCAAAC TTGACGGCAA TCCTAGCGTG AAGGCTGGTA * GGATTTTATG - 4800 

- CGCGCTGCCA TCATGGTTGG AGOATTGAAC- TGGATCGTCG -CCGTGTGGCA AAATATGGGG — -4860* 

" ATTGGCAAGA ACGGAGACCT ACCCTGGCCT CCGCTCAGGA ACGAGT.TCAA GTACTTCCAA 4920 " 

:-AGAATGACCA CAACCTCTTC AGTGGAAGGT AAACAGAATC TGGTGAT-TAT GGGTAGGAAA . 4980 

"ACCTGGTTCT 'CCATTCCTGA' GAAGAATCGA CCTTTAAAGG ACAGAATTAA TATAGTTCTC 5040 - 

: . AGTAGAGAAC TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT 5100 

' GCCTTAAGAC TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC 5160 - 

' ■' GGAGGCAGTT CTGTTTACCA GGAAGCCATG AATCAACCAG GGCACCTTAG ACTCTTTGTG 5220 

ACAAGGATCA TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA 52 80 

TATAAACTTC TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC 5340 

AAGTATAAGT TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT 5400 

GCTCCCCTCC TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC 5460 

TCTTTGTGAA GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA 5520 

TTTAAAGCTC TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC 5580 

TAATTGTTTG TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA 5640 

ATGCCTTTAA TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG 5700 

CTACTGCTGA CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA 57 60 

AGGACTTTCC TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC 5820 

TTGCTTGCTT TGCTATTTAC ACCACAAAGG AAAAAGCTGC ACTGCTATAC AAGAAAATTA 5880 

TGGAAAAATA TTCTGTAACC TTTATAAGTA GGCATAACAG TTATAATCAT AACATACTGT 5940 

TTTTTCTTAC TCCACACAGG CATAGAGTGT CTGCTATTAA TAACTATGCT CAAAAATTGT 6000 

GTACCTTTAG CTTTTTAATT TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT 6060 

TGACTAGAGA TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC 6120 

CTCCCACACC TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG 6180 

TTTATTGCAG CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA 6240 

GCATTTTTTT CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT 6300 

GTCTGGATCG GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC 6360 

CCCAACTTGT TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC 6420 

ACAAATAAAG CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA 6480 

TCTTATCATG TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG 6540 
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CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC 6600 

ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC 6660 

TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA 6720 

CGCGCGGGGA GAGGCGGTTT - GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 6780 

CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG 6840 

TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG 6900 

GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC '6960 

GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA 7020 

TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCT'C CTGTTCCGAC CCTGCCGCTT 7 080 

ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC 7140 

TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC 7200 

CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA 7260 

AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 7320 

GTAGGCGGTG CTACAGAGTT . CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 7380 

GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT 7440 

TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT 7500 

ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT 7 560 

CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 7 620 

ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA 7 680 

ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA 7740 

TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC 7 800 

TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT 7860 

TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 792 0 

TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 7980 

AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT 8040 

GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 8100 

TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 8160 

GCAGTGTTAT CACTCATGGT TATGGCAGC Av. CTGCATAATT CTCTTACTGT CATGCCATCC 822 0 
■GTAAGATGGT-TTTCTGTGAe -TGGTGAGT-AG-TGAACGAAGT- CA-TTCTGAGA ATAGTGTATG ■ 8280- 

CGGCGACCGA GTTGCTCTTG "CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA 8340 

ACTTTAAAAG TGCTCATCAT. TGGAAAACG-T--TCTTCGGGGC -GAAAACTCTC ZAAGGATC.TTA 8400 

'CCGCTGTTGA:"GATCCAGTTC GATGTAACCC.ACTCGTGCAC" CCAACTGATC TTCAGCATCT- 8460 

TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAAGAGGAA GGCAAAATGC . CGCAAAAAAG 8520 

GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA' ATATTATTGA 8580* 

AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAG AAAAAT 864,0 

AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT C 8691 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8321 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GACGGATCGG GAGATCTGCT AGGTGACCTG AGGCGCGCCG GCTTCGAATA GCCAGAGTAA 60 

CCTTTTTTTT TAATTTTATT TTATTTTATT TTTGAGATGG AGTTTGGCGC CGATCTCCCG 120 

ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGTATC 180 

TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT TAAGCTACAA 240 

CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG CGTTTTGCGC 300 

TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT AGTTATTAAT 360 

AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 420 

TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 480 
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TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGACT 540 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC 600 

CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT 660 

GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 720 

GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 780 

TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 840 

AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA GGGTGGGAGG 900 

TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTACTGG CTTATCGAAA 960 

TTAATACGAC TCACTATAGG GAGACCCAAG CTTGGTACCA ATTTAAATTG ATATCTCCTT 1020 

AGGTCTCGAG TCTCTAGATA ACCGGTCAAT CGATTGGAAT TCTTGCGGCC GCTTGCTAGC 1080 

CACCATGGAG TTGTGGTTAA GCTTGGTCCT TCCTTGTCCT TGTTTTAAAA GGTGTCCAGT 1140 

GTGAAGTGAA TCTGGTGGAG TCTGGGGGAG GCTTAGTGCA GCCTGGAGGG TCCCTGAAAG 12 00 

TCTCCTGTGT AACCTCTGGA TTCACTTTCA GTGACTATTA CATGTATTGG GTTCGCCAGA 1260 

CTCCAGAGAA GAGGCTGGAG TGGGTCGCAT ACATTAGTCA AGGTGGTGAT ATAACCGACT 1320 

ATCCAGACAC TGTAAAGGGT CGATTCACCA TCTCCAGAGA CAATGCCAAG AACACCCTGT 1380 

ACCTGCAAAT GAGCCGTCTG AAGTCTGAGG ACACAGCCAT GTATTACTGT GCAAGAGGCC 1440 

TGGACGACGG GGCCTGGTTT GCTTACTGGG GCCAAGGGAC TCTGGTCACG GTCTCTGTAG 1500 

CTAGCACCAA GGGCCCATCG GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG 1560 

GCACAGCGGC CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 162 0 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA CAGTCCTCAG 1680 

GACTCTACTC CCTCAGCAGC GTGGTCACCG TGCCCTCCAG CAGCTTGGGC ACCCAGACCT 1740 

ACATCTGCAA CGTGAATCAC AAGCCCAGCA ACACCAAGGT GGACAAGAAA GTTGGTGAGA 1800 

GGCCAGCACA GGGAGGGAGG GTGTCTGCTG GAAGCCAGGC TCAGCGCTCC TGCCTGGACG 1860 

CATCCCGGCT ATGCAGCCCC AGTCCAGGGC AGCAAGGCAG GCCCCGTCTG CCTCTTCACC 1920 

CGGAGGCCTC TGCCCGCCCC ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCCCAG 1980 

GCTCTGGGCA GGCACAGGCT AGGTGCCCCT AACCCAGGCC CTGCACACAA AGGGGCAGGT 2 040 

GCTGGGCTCA .GACCTGCCAA GAGCCATATC CGGGAGGACC CTGCCCCTGA CCTAAGC.CCA .2100 

CCCCAAAGGC- CAAAGTCTGC -ACTCCCTCAG' CTCGGACACC .TTCTCTCCTC CCAGATTCCA 2160 

GTAACTCCCA-ATCTTCTCTC- TGCAGAGCGG AAATGTTGTG- ACAAAAGTCA -CACATGCCCA , 222-0- 

' CCGTGCCCAG " GTAAGCCAGC CCAGGCCTCG" CCCTCCAGCT CAAGGCGGGA CAGGTGCCCT 2280 

^AGAGTAGCCT~GGATCCAGGG .'AGACAGCACG TGGGTACCAA XATGTCCGGA GCCACATGGA . '234-0 

"CAGAGGCCGG^nTCGGCCCAC' CCTCTGCGCT " GAGAGTGACC"GCTGTACCAAT CCTCTGTCCC- '2400 

TACAGGGCAG CCCGGAGAAC CACAGGTGTA ' CACCCTGCCC CCATCCCGGG ATGAGCTGAC 2460 

CAAGAACCAG GTCAGCCTGA CCTGCCTGGT CAAAGGCTTC ' TATCCCAGCG ACATCGCCGT 2 52 0 

GGAGTGGGAG AGCAATGGGC AGCCGGAGAA CAACTACAAG ACCACGCCTC CCGTGCTGGA 2580 

CTCCGACGGC TCCTTCTTCC TCTACAGCAA GCTCACCGTG GACAAGAGCA GGTGGCAGCA 2640 

GGGGAACGTC TTCTCATGCT CCGTGATGCA TGAGGCTCTG CACAACCACT ACACGCAGAA 2700 

GAGCCTCTCC CTGTCTCCGG GTAAATGAGT GCGACGGCCG GCAAGCCCCC GCTCCCCGGG 2760 

CTCTCGCGGT CGCACGAGGA TGCTTGGCAC GTACCCCCTG TACATACTTC CCGGGCGCCC 2820 

AGCATGGAAA TAAAGCACCC AGCGCTGCCC TGGGCCCCTG CGAGACTGTG ATGGTTCTTT 2 880 

CCACGGGTCA GGCCGAGTCT GAGGCCTGAG TGGCATGAGG GAGGCAGAGC GGGTCCCACT 2940 

GTCCCCACAC TGGCCCAGGC TGTGCAGGTG TGCCTGGGCC CCCTAGGGTG GGGCTCAGCC 3000 

AGGGGCTGCC CTCGGCAGGG TGGGGGATTT GCCAGCGTGG CCCTCCCTCC AGCAGCACCT 3060 

GCCCTGGGCT GGGCCACGGG AAGCCCTAGG AGCCCCTGGG GACAGACACA CAGCCCCTGC 3120 

CTCTGTAGGA GACTGTCCTG TTCTGTGAGC GCCCCTGTCC TCCCGACCTC GATGCCCACT 3180 

CGGGGGCATG CCTAGTCCAT GTGCGTAGGG ACAGGCCCTC CCTCACCCAT CTACCCCCAC 3240 

GGCACTAACC CCTGGCTGCC CTGCCCAGCC TCGCACCCGC ATGGGGACAC AACCGACTCC 3300 

GGGGACATGC ACTCTCGGGC CCTGTGGAGG GACTGGTGCA GATGCCCACA CACACACTCA 3360 

GCCCAGACCC GTTCAACAAA CCCCGCACTG AGGTTGGCCG GCCACACGGC CACCACACAC 3420 

ACACGTGCAC GCCTCACACA CGGAGCCTCA CCCGGGCGAA CTGCACAGCA CCCAGACCAG 3480 

AGCAAGGTCC TCGCACACGT GAACACTCCT CGGACACAGG CCCCCACGAG CCCCACGCGG 3540 

CACCTCAAGG CCCACGAGCC TCTCGGCAGC TTCTCCACAT GCTGACCTGC TCAGACAAAC 3600 

CCAGCCCTCC TCTCACAAGG GTGCCCCTGC AGCCGCCACA CACACACAGG GGATCACACA 3660 

CCACGTCACG TCCCTGGCCC TGGCCCACTT CCCAGTGCCG CCCTTCCCTG CAGGACGGAT 3720 

CAGCCTCGAC TGTGCCTTCT AGTTGCCAGC CATCTGTTGT TTGCCCCTCC CCCGTGCCTT 3780 

CCTTGACCCT GGAAGGTGCC ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT 3840 

CGCATTGTCT GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 3900 
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GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT ATGGCTTCTG 
AGGCGGAAAG AACCAGCTGG GGCTCTAGGG GGTATCCCCA CGCGCCCTGT AGCGGCGCAT 
TAAGCGCGGC GGGTGTGGTG GTTACGCGCA GCGTGACCGC TACACTTGCC AGCGCCCTAG 
CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC GTTCGCCGGG CCTCTCAAAA 
AAGGGAAAAA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC CTAACTCCGC 
CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC TGACTAATTT 
TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG AAGTAGTGAG 
GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGG ACAGCTCAGG GCTGCGATTT 
CGCGCCAAAC TTGACGGCAA TCCTAGCGTG AAGGCTGGTA GGATTTTATC CCCGCTGCCA 
TCATGGTTCG ACCATTGAAC TGCATCGTCG CCGTGTCCCA AAATATGGGG ATTGGCAAGA 
ACGGAGACCT ACCCTGGCCT CCGCTCAGGA ACGAGTTCAA GTACTTCCAA AGAATGACCA 
CAACCTCTTC AGTGGAAGGT AAACAGAATC TGGTGATTAT GGGTAGGAAA ACCTGGTTCT 
CCATTCCTGA GAAGAATCGA CCTTTAAAGG ACAGAATTAA TATAGTTCTC AGTAGAGAAC 
TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT GCCTTAAGAC 
TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC GGAGGC AGTT 
CTGTTTACCA GGAAGCCATG AATCAACCAG GCCACCTTAG ACTCTTTGTG ACAAGGATCA 
TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA TATAAACTTC 
TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC AAGTATAAGT 
TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT GCTCCCCTCC 
TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC TCTTTGTGAA 
GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA TTTAAAGCTC 
TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC TAATTGTTTG 
TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA ATGCCTTTAA 
TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG CTACTGCTGA 
CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA AGGACTTTCC 
TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC TTGCTTGCTT 
TGCTATT.TAC. ACCACAAAGG AAAAAGC TGC ACTGCTATAC AAGAAAATTA TGG AAAAAT A . 
vrTCTGTAACC' TTTATAAGTA GGCATAACAG TTATAATCAT • AACATACTGT TTTTTCTTAC 
TC€ACACAGG*'(~ ATAGAGTGT * GTGCTATTAA TAACTATGOT GAAAAATTGT -GTACCTT-TAG 
CTTTTTAATT "TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT TG AC TAG AG A 
-TGATAATCAG~CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC CTCCCACACC" 



"TCCCCCTGXA^CCTGAAACAT AAAATGAATG" CAATTGTTGT TGTTAACTTG:* TTTATTGCAG' 
: CT-TAT AATGG .TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 
CACTGCATTC " TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT GTCTGGATCG 
GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCCAACTTGT 
TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 
CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 
TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG 
TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 
AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 
CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 
GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT- CACTGACTCG CTGCGCTCGG 
TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 
AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 
GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 
TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 
TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC 
TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 
CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 
TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 
CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 
TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 
AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 
AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 
AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 
TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 
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ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 7380 

CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 7440 . 

GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 7500 

TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 75 60 

TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 7620 

GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 7680 

CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 7740 

AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 7 800 

CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 7 860 

TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA 7920 

GTTGCTCTTG CCCGGCGTCA AT AC GGG AT A ATACCGCGCC ACATAGCAGA ACTTTAAAAG 7980 

TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 8040 

GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 8100 

CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 8160 

CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 8220 

" AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 82 80 

GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT C 8321 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

■-■_„. (ii) MOLECULE TYPE: cDNA _ „ . 

.'-—■>---.- ~{xi) SEQUENCE* DESCRIPTION*:- SEQ ID NO:-ll:- - - - - * - - 

■\. GGTACCAATT TAAATTGATA TCTCCTTAGG^TCTCGAGCAC - ,CSTGAAGT-TG^"CCTGlETAGGC ~~. -6'0:^. 

TGTTGGTGCT. "GATGTTCTGG ATTCCTGCTT~CCAGCAGTGA " TGTTT.TGATG ."ACCCAAATTC 120 ~ : 7 

' CAGTCTCCCT GCCTGTCAGT CTTGGAGATC AAGCGTCCAT CTCTTGCAGA TCTAGTCAGA 180.^ \ 

" ■ " "TCATTGTACA TAATAATGGC AACACCTATT TAGAATGGTA CCTGCAGAAA CCAGGCCAGT 240 ' ' • ~ * 

■ CTCCACAGCT CCTGATCTAC AAAGTTTCCA ACCGATTTTC TGGGGTCCCA GACAGGTTCA 300 1 

GCGGCAGTGG ATCAGGGACA GATTTCACAC TCAAGATCAG CAGAGTGGAG GCTGAGGATC 360 

TGGGAGTTTA TTACTGCTTT CAAGGTTCAC ATGTTCCATT CACGTTCGGC TCGGGGACAA 420 

AGTTGGAAAT AAAACGTAAG TCTCGAGTCT CTAGATAACC GGTCAATCGA TTGGAATTCT 480 

AAACTCTGAG GGGGTCGGAT GACGTGGCCA TTCTTTGCCT AAAGCATTGA GTTTACTGCA 540 

AGGTCAGAAA AGCATGCAAA GCCCTCAGAA TGGCTGCAAA GAGCTCCAAC AAAACAATTT 600 

AGAACTTTAT TAAGGAATAG GGGGAAGCTA GGAAGAAACT CAAAACATCA AGATTTTAAA 660 

TACGCTTCTT GGTCTCCTTG CTATAATTAT CTGGGATAAG CATGCTGTTT TCTGTCTGTC 720 

CCTAACATGC CCTTATCCGC AAACAACACA CCCAAGGGCA GAACTTTGTT ACTTAAACAC 7 80 

CATCCTGTTT GCTTCTTTCC TCAGGAACTG TGGCTGCACC ATCTGTCTTC ATCTTCCCGC 840 

CATCTGATGA GCAGTTGAAA TCTGGAACTG CCTCTGTTGT GTGCCTGCTG AATAACTTCT 900 

ATCCCAGAGA GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 960 

AGGAGAGTGT CACAGAGCAG GAGAGCAAGG ACAGCACCTA CAGCCTCAGC AGCACCCTGA 1020 

CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC CTGCGAAGTC ACCCATCAGG 1080 

GCCTGAGCTC GCCCGTCACA AAGAGCTTCA ACAGGGGAGA GTGTTAGAGG GAGAAGTGCC 1140 

CCCACCTGCT CCTCAGTTCC AGCCTGACCC CCTCCCATCC TTTGGCCTCT GACCCTTTTT 1200 

CCACAGGGGA CCTACCCCTA TTGCGGTCCT CCAGCTCATC TTTCACCTCA CCCCCCTCCT 1260 

CCTCCTTGGC TTTAATTATG CTAATGTTGG AGGAGAATGA ATAAATAAAG TGAATCTTTG 1320 

CACCTGTGGT TTCTCTCTTT CCTCATTTAA TAATTATTAT CTGTTGTTTT ACCAACTACT 1380 

CAATTTCTCT TATAAGGGAC TAAATATGTA GTCATCCTAA GGCACGTAAC CATTTATAAA 1440 

AATCATCCTT CATTCTATTT TACCCTATCA TCCTCTGCAA GACAGTCCTC CCTCAAACCC 1500 

ACAAGCCTTC TGTCCTCACA GTCCCCTGGG CCATGGTAGG AGAGACTTGC TTCCTTGTTT 1560 

TCCCCTCCTC AGCAAGCCCT CATAGTCCTT TTTAAGGGTG ACAGGTCTTA CAGTCATATA 162 0 
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TCCTTTGATT CAATTCCCTG AGAATCAACC AAAGCAAATT TTTCAAAAGA AGAAACCTGC 1680 

TATAAAGAGA ATCATTCATT GCAACATGAT ATAAAATAAC AACACAATAA AAGCAATTAA 1740 

ATAAACAAAC AATAGGGAAA TGTTTAAGTT CATCATGGTA CTTAGACTTA ATGGAATGTC 1800 

ATGCCTTATT TACATTTTTA AACAGGTACT GAGGGACTCC TGTCTGCCAA GGGCCGTATT 1860 

GAGTACTTTC CACAACCTAA TTTAATCCAC ACTATACTGT GAGATTAAAA ACATTCATTA 1920 

AAATGTTGCA AAGGTTCTAT AAAGCTGAGA GACAAATATA TTCTATAACT CAGCAATCCC 1980 

ACTTCTAGAT GACTGAGTGT CCCCACCCAC CAAAAAACTA TGCAAGAATG TTCAAAGCAG 2040 

CTTTATTTAC AAAAGCCAAA AATTGGAAAT AGCCCGATTG TCCAACAATA GAATGAGTTA 2100 

TTAAACTGTG GTATGTTTAT ACATTAGAAT ACCCAATGAG GAGAATTAAC AAGCTACAAC 2160 

TATACCTACT CACACAGATG AATCTCATAA AAATAATGTT ACATAAGAGA AACTCAATGC 2220 

AAAAGATATG TTCTGTATGT TTTCATCCAT ATAAAGTTCA AAACCAGGTA AAAATAAAGT 2280 

TAGAAATTTG GATGGAAATT ACTCTTAGCT GGGGGTGGGC GAGTTAGTGC CTGGGAGAAG 2340 

ACAAGAAGGG GCTTCTGGGG TCTTGGTAAT GTTCTGTTCC TCGTGTGGGG TTGTGCAGTT 2400 

ATGATCTGTG CACTGTTCTG TATACACATT ATGCTTCAAA ATAACTTCAC ATAAAGAACA 2460 

TCTTATACCC AGTTAATAGA TAGAAGAGGA ATAAGTAATA GGTCAAGACC AACGCAGCTG 2520 

GTAAGTGGGG GCCTGGGATC AAATAGCTAC CTGCCTAATC CTGCCCWCTT GAGCCCTGAA 2580 

TGAGTCTGCC TTCCAGGGCT CAAGGTGCTC AACAAAACAA CAGGCCTGCT ATTTTCCTGG 2640 

CATCTGTGCC CTGTTTGGCT AGCTAGGAGC ACACATACAT AGAAATTAAA TGAAACAGAC 27 00 

CTTCAGCAAG GGGACAGAGG ACAGAATTAA CCTTGCCCAG ACACTGGAAA CCCATGTATG 27 60 

AACACTCACA TGTTTGGGAA GGGGGAAGGG CACATGTAAA TGAGGACTCT TCCTCATTCT 2820 

ATGGGGCACT CTGGCCCTGC CCCTCTCAGC TACTCATCCA TCCAACACAC CTTTCTAAGT 2 880 

ACCTCTCTCT GCCTACACTC TGAAGGGGTT CAGGAGTAAC TAACACAGCA TCCCTTCCCT 2940 

CAAATGACTG ACAATCCCTT TGTCCTGCTT TGTTTTTCTT TCCAGTCAGT ACTGGGAAAG 3 000 

TGGGGAAGGA CAGTCATGGA GAAACTACAT AAGGAAGCAC CTTGCCCTTC TGCCTCTTGA 3060 

GAATGTTGAT GAGTATCAAA TCTTTCAAAC TTTGGAGGTT TGAGTAGGGG TGAGACTCAG 312 0 

TAATGTCCCT TCCAATGACA TGAACTTGCT CACTCATCCC TGGGGGCCAA ATTGAACAAT 3180 

CAAAGGCAGG CATAATCCAG TTATGAATTC TTGCGGCCGC TTGCTAGCT.T- CACGTGTTGG 3240 

ATCCAACGGC GGAAGGGCCC TATTCTATAG TGTCACCTAA ATGCTAGAGC TCGCTGATCA 3300 

GCCTCGACTG TGCCTTCTAG TTGCCAGCCA- : TCTGTTGTTT • GCGCGTCGCC CGTGCCTTCC ■ 3360 

TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG 3420 

CATTGTCTGA. GTAGGTGTCA TTG^AT-TCTG-GjGGGGTGGGG^GGGGCAGGA^CAGCAAGGGG 3480 
'.GAGGATTGGG"AAGA'CAATAG .CAGGCATJGCT" GGGGATGCGG~TGGGCTCTAT~GGCTTCTGAG- ■■-3'5"4"Q" 

GCGGAAAGAA CCAGCTGGGG CTCTAGGGGG* - TATGCCCACG . CGCCCTGTAG. CGGCGCATTA 3600 

AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGGTA "CACTTGCCAG CGCCCTAGCG 3660 

CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT TCGCCGGGCC TCTCAAAAAA 3720 

GGGAAAAAAA GCATGCATCT CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC 37 80 

ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT 3840 

TTTATTTATG CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA 3900 

GGCTTTTTTG GAGGCCTAGG CTTTTGCAAA AAGCTTGGAC AGCTCAGGGC TGCGATTTCG 3960 

CGCCAAACTT GACGGCAATC CTAGCGTGAA GGCTGGTAGG ATTTTATCCC CGCTGCCATC 4020 

ATGGTTCGAC CATTGAACTG CATCGTCGCC GTGTCCCAAA ATATGGGGAT TGGCAAGAAC 4080 

GGAGACCTAC CCTGGCCTCC GCTCAGGAAC GAGTTCAAGT ACTTCCAAAG AATGACCACA 4140 

ACCTCTTCAG TGGAAGGTAA ACAGAATCTG GTGATTATGG GTAGGAAAAC CTGGTTCTCC 4200 

ATTCCTGAGA AGAATCGACC TTTAAAGGAC AGAATTAATA TAGTTCTCAG TAGAGAACTC 4260 

AAAGAACCAC CACGAGGAGC TCATTTTCTT GCCAAAAGTT TGGATGATGC CTTAAGACTT 4320 

ATTGAACAAC CGGAATTGGC AAGTAAAGTA GACATGGTTT GGATAGTCGG AGGCAGTTCT 4380 

GTTTACCAGG AAGCCATGAA TCAACCAGGC CACCTTAGAC TCTTTGTGAC AAGGATCATG 4440 

CAGGAATTTG AAAGTGACAC GTTTTTCCCA GAAATTGATT TGGGGAAATA TAAACTTCTC 4500 

CCAGAATACC CAGGCGTCCT CTCTGAGGTC CAGGAGGAAA AAGGCATCAA GTATAAGTTT 4560 

GAAGTCTACG AGAAGAAAGA CTAACAGGAA GATGCTTTCA AGTTCTCTGC TCCCCTCCTA 4620 

AAGCTATGCA TTTTTATAAG ACCATGGGAC TTTTGCTGGC TTTAGATCTC TTTGTGAAGG 4680 

AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA 4740 

AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA ATTGTTTGTG 4800 

TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT GCCTTTAATG 4860 

AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATC TAG TGA TGATGAGGCT ACTGCTGACT 4920 

CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG GACTTTCCTT 4980 

CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 5040 



10 



CTATTTACAC CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG GAAAAATATT 5100 

CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT TTTCTTACTC 5160 

CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT ACCTTTAGCT 5220 

TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC 52 80 

ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 5340 

CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT 5400. 

TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA 5460 

CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCGGC 5520 

TGGATGATCC ' TCCAGCGCGG GGATCTCATG CTGGAGTTCT TCGCCCACCC CAACTTGTTT 5580' 

ATTGCAGCTT ATAATGGTTA CAAATAAAGC AATAGC ATC A CAAATTTC AC AAATAAAGCA 5640 

TTTTTTTCAC TGCATTCTAG TTGTGGTTTG TCCAAACTCA TCAATGTATC TTATCATGTC 57 00 

TGTATACCGT CGACCTCTAG CTAGAGCTTG GCGTAATCAT GGTCATAGCT GTTTCCTGTG 57 60 

TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT AAAGTGTAAA 582 0 

GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC ACTGCCCGCT 5880 

TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG CGCGGGGAGA 5940 

GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT GCGCTCGGTC 6000 

GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT ATCCACAGAA 6060 

TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT 612 0 

AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 6180 

AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT 6240 

CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 6300 

TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG TAGGTATCTC 6360 

AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC 6420 

GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 6480 

TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT 6540 

ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 6600 

TGCGCTCTGC. TGAAGCCAGT. TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA 6660 

• CAAACCACCG, GTGGTAGCGG TGGTTTTTTT GTT.TGC AAGC . ' AGCAGATTAC . GCGCAGAAAA 672 0 

AAAGGATGTG AAGAAGATCC ■ TTTGATGT-T-T— TGTAGGGGGT CTGACGCTCA GTGGAAGGAA • - 

AACTCACGTT" AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT 6840" 

TTAAAT 'i ' AAA- -AATGAAGffTO ^AAA'TCAATC- TAAAGTATAT ATGAGTAAAC TTGGTCTGAC . "r6-9"00 ~ " " 

AGTTACCAA'J^GCTOAATCAXr^SGGCACCT^ATCTCAGCGff "TCTGTCTATT" TCGTTCATCC" — £960— 

ATAGTTGCCT" GACTCCCCGT. CGTGTAGATA 'ACTACGATAC. GGGAGGGCTT ACCATCTGGC 7020 

CCCAGTGCTG "CAATGATACC 'GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA 7 080 

AACCAGCCAG CCGGAAGGGC- CGAGCGCAGA -AGTGGTCC.TG CAACTTTATC CGCCTCCATC 7140 

CAGTCTATTA ATTGTTGCCG "GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 72 00 

AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA 72 60 

TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA 7320 

GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 7380 

CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 7440 

TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT 7500 

TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG 7560 

CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC GCTGTTGAGA 7620 

TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 7 680 

AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 7740 

ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG 7800 

GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 7860 

GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCG ACGGATCGGG AGATCTGCTA 7920 

GCCCGGGTGA CCTGAGGCGC GCCGGCTTCG AATAGCCAGA GTAACCTTTT TTTTTAATTT 7980 

TATTTTATTT TATTTTTGAG ATGGAGTTTG GCGCCGATCT CCCGATCCCC TATGGTCGAC 8040 

TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT 8100 

GTTGGAGGTC GCTGAGTAGT GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC 8160 

CGACAATTGC ATGAAGAATC TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG 8220 

GGCCAGATAT ACGCGTTGAC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG 8280 

GTCATTAGTT CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC 8340 

GCCTGGCTGA CCGCCCAACG ACCCCCGCCC - ATTGACGTCA AT AATG AC GT ATGTTCCCAT 8400 

AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC 8460 
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CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA 8520 

CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG 8580 

GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT 8640 

CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT 8700 

CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC 8760 

CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC 8820 

TCTCTGGCTA ACTAGAGAAC CCACTGCTTA CTGGCTTATC GAAATTAATA CGACTCACTA 8880 

TAGGGAGACC CAAGCTT 8897 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGTCT CTAGATAACC GGTCAATCGA 60 
TTGGAATTCT TGCGGCCGCT TGCTAGCCAC CATGGAGTTG TGGTTAAGCT TGGTCTTCCT 120 
TGTCCTTGTT TTAAAAGGTG TCCAGTGTGA AGTGCAACTG GTGGAGTCTG GGGGAGGCTT 180 
AGTGCAGCCT GGAGGGTCCC TGCGACTTTC CTGTGCTGCA TCTGGATTCC CGTTCAGTGA 240 
CTATTACATG TATTGGGTTC GCCAGGCTCC AGGCAAGGGA CTGGAGTGGG TCTCATACAT 300 
. - TAGTC AAGAT GGTGATATAA CCGACTATGC AGACTCCGTA AAGGGTCGAT TCACCATCTC ..3.60. 
CAGAGACAAT GCAAAGAACA 'GCCTGTACCT . GC AAATGAAC AGCCTGAGGG ACGAGGACAC - -.u 420- 
* * — • ■ • AGCCGTGTAT - -T ACTGTGGAAr G AGGCCTGGC GGAGGGGGGG- -TGGTTTGG-TT AGTGGGGCGA • i • '480 
- " — -:- " "AGGGACTCTG • GTCACGGTCT CTTCCGCTAG CACCAAGGGC CCATCGGTCT TCCCCCTGGC 540 
A CC£TGCTCC/ AAGAGCACCT C-TGGGGGCAC AGCGGCCCTG GGCTGGCTGG-TCAAGGACTA — ~~6:0tC 



T:TTCCT^GAA~:CCGGTGACGG"TGTCGTG 

CTTCCCGGCT GTCCTACAGT CCTCAGGACT* ;CTACTCCCTC AGCAGCGTGG TCACCGTGCC J 2 0 

CTCCAGCAGC -TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC 7 80* 

CAAGGTGGAC -AAGAAAGTTG GTGAGAGGCC AGCACAGGGA GGGAGGGTGT CTGCTGGAAG • 840 

CCAGGCTCAG CGCTCCTGCC TGGACGCATC CCGGCTATGC AGCCCCAGTC CAGGGCAGCA 900 

AGGCAGGCCC CGTCTGCCTC TTCACCCGGA GGCCTCTGCC CGCCCCACTC ATGCTCAGGG 960 

AGAGGGTCTT CTGGCTTTTT CCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC 102 0 

CAGGCCCTGC ACACAAAGGG GC AGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG 1080 

AGGACCCTGC CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG 1140 

GACACCTTCT CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT 1200 

CTTGTGACAA AACTCACACA TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT 1260 

CCAGCTCAAG GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAC ACCACGTGGG 1320 

TACCAACATG TCCGGAGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC TGCCCTGAGA 1380 

GTGACCGCTG TACCAACCTC TGTCCCTACA GGGCAGCCCC GAGAACCACA GGTGTACACC 1440 

CTGCCCCCAT CCCGGGATGA GCTGACCAAG AACCAGGTCA GCCTGACCTG CCTGGTCAAA 1500 

GGCTTCTATC CCAGCGACAT CGCCGTGGAG TGGGAGAGCA ATGGGCAGCC GGAGAACAAC 1560 

TACAAGACCA CGCCTCCCGT GCTGGACTCC GACGGCTCCT TCTTCCTCTA CAGCAAGCTC 162 0 

ACCGTGGACA AGAGCAGGTG GCAGCAGGGG AACGTCTTCT CATGCTCCGT GATGCATGAG 1680 

GCTCTGCACA ACCACTACAC GCAGAAGAGC CTCTCCCTGT CTCCGGGTAA ATGAGTGCGA 1740 

CGGCCGGCAA GCCCCCGCTC CCCGGGCTCT CGCGGTCGCA CGAGGATGCT TGGCACGTAC 1800 
CCCCTGTACA TACTTCCCGG GCGCCCAGCA TGGAAATAAA GCACCCAGCG CTGCCCTGGG , 1860 

CCCCTGCGAG ACTGTGATGG TTCTTTCCAC GGGTCAGGCC GAGTCTGAGG CCTGAGTGGC 1920 

ATGAGGGAGG CAGAGCGGGT CCCACTGTCC CCACACTGGC CCAGGCTGTG CAGGTGTGCC 1980 

TGGGCCCCCT AGGGTGGGGC TCAGCCAGGG GCTGCCCTCG GCAGGGTGGG GGATTTGCCA 2040 

GCGTGGCCCT CCCTCCAGCA GCACCTGCCC TGGGCTGGGC CACGGGAAGC CCTAGGAGCC 2100 

CCTGGGGACA GACACACAGC CCCTGCCTCT GTAGGAGACT GTCCTGTTCT GTGAGCGCCC 2160 
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CTGTCCTCCC GACCTCCATG CCCACTCGGG GGCATGCCTA GTCCATGTGC GTAGGGACAG 2220 

GCCCTCCCTC ACCCATCTAC CCCCACGGCA CTAACCCCTG GCTGCCCTGC CCAGCCTCGC 2280 

ACCCGCATGG GGACACAACC GACTCCGGGG ACATGCACTC TCGGGCCCTG TGGAGGGACT 2340 

GGTGCAGATG CCCACACACA CACTCAGCCC AGACCCGTTC AACAAACCCC GCACTGAGGT 2 400 

TGGCCGGCCA CACGGCCACC ACACACACAC GTGCACGCCT CACACACGGA GCCTCACCCG 2460 

GGCGAACTGC ACAGCACCCA GACCAGAGCA AGGTCCTCGC ACACGTGAAC ACTCCTCGGA 2520 

CACAGGCCCC CACGAGCCCC ACGCGGCACC TCAAGGCCCA CGAGCCTCTC GGCAGCTTCT 2580 

CCACATGCTG ACCTGCTCAG ACAAACCCAG CCCTCCTCTC ACAAGGGTGC CCCTGCAGCC 2 640 

GCCACACACA CACAGGGGAT CACACACCAC GTCACGTCCC TGGCCCTGGC CCACTTCCCA 27 00 

GTGCCGCCCT TCCCTGCAGG ACGGATCAGC CTCGACTGTG CCTTCTAGTT GCCAGCCATC 27 60 

TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 2820 

TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT CTATTCTGGG 2880 

GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG 2940 

GGATGCGGTG GGC TCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT CTAGGGGGTA 3000 

TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 3060 

GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CT.TCCTTTCT 3120 

CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA ATTAGTCAGC 3180 

AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA GTTCCGCCCA 3240 

TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG CCGCCTCGGC 3300 

CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT TTTGCAAAAA 33 60 

GCTTGGACAG CTCAGGGCTG CGATTTCGCG CCAAACT.TGA CGGCAATCCT AGCGTGAAGG 342 0 

CTGGTAGGAT TTTATCCCCG CTGCCATCAT GGTTCGACCA TTGAACTGCA TCGTCGCCGT 3480 

GTCCCAAAAT ATGGGGATTG GCAAGAACGG AGACCTACCC TGGCCTCCGC TCAGGAACGA 3540 

GTTCAAGTAC TTCCAAAGAA TGACCACAAC CTCTTCAGTG GAAGGTAAAC AGAATCTGGT 3600 

GATTATGGGT AGGAAAACCT GGTTCTCCAT TCCTGAGAAG AATCGACCTT TAAAGGACAG 3660 

AATTAATATA GTTCTCAGTA GAGAACTCAA AGAACCACCA CGAGGAGCTC ATTTTCTTGC 372 0 

. ' .--r. CAAAAGTTTG GATGATGCCT TAAGACTTAT TGAACAACCG GAATTGGCAA GTAAAG.TAGA _.378.0_. _■ 

- . CATGGTTTGG ATAGTCGGAG GCAGTTCTGT TTACCAGGAA. GCCATGAATC AACCAGGCCA. 3840 . . 

j~ .----eoTTAGACTC TTTGTGAGAA -GGATGATGCA-GGAATTTGAA AGTGAGAGGT TTTTCCCAGA- 390.0-- 

~.v - AATTGATTTG GGGAAATATA AACTTCTCCC AGAATACCCA GGCGTCCTCT CTGAGGTCCA 3960 

~ .-TK GGAGGAAAAA GGCATCAAGT ATAAGTTTCA' AGTCTACGAG^MAKAGaC3XrAACAGGAAGA. 4*.a2Q , * 

~ _ TGCTTTCAAG~"TTeTCTGCTC^X^ 4-O80 

. * : T.TGCTGGCTT . TAGATCTCTT TGTGAAGGAA CCTTACTTCT GTGGTGT.GAG ATAATTGGAC .4140 ' ....... 

• ' • AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG ' JTGTATAATGT ^-4200 * . — 

GTTAAACTAC TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG AACTGATGAA' 4260 

TGGGAGCAGT GGTGGAATGC C.TTTAATGAG GAAAACCTGT TTTGCTCAGA AGAAATGCCA 4320 

TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA AAAGAAGAGA 43 80 

AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG TCATGCTGTG 4440 

TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA CAAAGGAAAA AGCTGCACTG 4500 

CTATACAAGA AAATTATGGA. AAAATATTCT GTAACCTTTA TAAGTAGGCA TAACAGTTAT 4560 

AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC TATTAATAAC 4620 

TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA TAAGGAATAT 4680 

TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG TAGAGGTTTT 4740 

ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT 4800 

TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC 4860 

AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT 4920 

CAATGTATCT TATCATGTCT GGATCGGCTG GATGATCCTC CAGCGCGGGG ATCTCATGCT 4980 

GGAGTTCTTC GCCCACCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA 5040 

TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC 5100 

CAAACTCATC AATGTATCTT ATCATGTCTG TATACCGTCG ACCTCTAGCT AGAGCTTGGC 5160 

GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA 5220 

CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC 5280 

ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA 5340 

TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC 5400 

CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC 5460 

AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC 5520 

AAAAGGCCAG' CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG 5580 
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GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 5640 

GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 5700 

TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT 5760 

TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG 5 820 

CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 5880 

TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 5940 

TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG 6000 

CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA 6060 

AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT 612 0 

TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC 6180 

TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT 6240 

ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA 6300 

AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT 6360 

CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC 6420 

TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG 6480 

CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG 6540 

TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT 6600 

AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT 6660 

GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT 6720 

TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT 6780 

CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT 6840 

TACTGTCATG CCATCCGTAA GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT 6900 

CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC GGGATAATAC 6960 

CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA 7 020 

ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA 7 080 

CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA 7140 

AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTOA^ TACTCTTCCT 7200 

-TTTTGAATAT. TATTGAAGC A- TTTATC AGGG ^TTATTGTCTC-i ATGAGCGGAT - ACATAT.TTGA . 7260. . 

* ATGTATTTAG ~ AAAAAT AAAO - -AAAT AGGGGT - TGCGCGC AC A TT-TCGCCGAA AAGTGGCACC ■ 7320 

TGACGTCGAC" GGATCGGGAG " ATCTGCTAGG "TGACCTGAGG .CGCGCCGGCT TCGAATAGCC 7380 

AGAGTAAGC.T--^FTTTTTTAA TTTTATTT-TA-4^T^ TTGGGGCCGA 7440 

""TCTCCCGATC CCCTATGGTC GAC TCTCAGT ^ AC AATCTGCT; "CTGATGCCGC" ATAGTTAAGC 7500 

CAGTATC.TGC TCCCTGCTTG TGTGTTGGAG_GTCGCTGAGT r AGTGCGCGAG : CAAAATTTAA .7560 

GCTACAACAA GGGAAGGCTT GACCGACAAT TGCATGAAGA; ATCTGCTTAG GGTTAGGCGT 7620 

TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT^ GACATTGATT ATTGACTAGT 7680 

TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 7740 

ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG CCCATTGACG 7 800 

TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG ACGTCAATGG 7 860 

GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA TATGCCAAGT 7920 

ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG. 7980 

ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 80.40 

GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC ACGGGGATTT 8100 

CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA TCAACGGGAC 8160 

TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG GCGTGTACGG 8220 

TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG AACCCACTGC TTACTGGCTT 8280 

ATCGAAATTA ATACGACTCA C TAT AGGG AG ACCCAAGCTT G 8321 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8897 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGGATCGG GAGATCTGCT AGCCCGGGTG ACCTGAGGCG CGCCGGCTTC GAATAGCCAG 60 

AGTAACCTTT TTTTTTAATT TTATTTTATT TTATTTTTGA GATGGAGTTT GGCGCCGATC 120 

TCCCGATCCC CTATGGTCGA CTCTCAGTAC AATCTGCTCT GATGCCGCAT AGTTAAGCCA 180 

GTATCTGCTC CCTGCTTGTG TGTTGGAGGT CGCTGAGTAG TGCGCGAGCA AAATTTAAGC 240 

TACAACAAGG CAAGGCTTGA CCGACAATTG CATGAAGAAT CTGCTTAGGG TTAGGCGTTT 300 

TGCGCTGCTT CGCGATGTAC GGGCCAGATA TACGCGTTGA CATTGATTAT TGACTAGTTA 360 

TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC 420 

ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC 480 

AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT 540 

GGACTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC 600 

GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 660 

CTTATGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT 720 

GATGCGGTTT TGGCAGTACA TCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC 7 80 

AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT 840 

TCCAAAATGT CGTAACAACT CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG 900 

GGAGGTCTAT ATAAGCAGAG CTCTCTGGCT AAC T AG AG AA CCCACTGCTT ACTGGCTTAT 960 

CGAAATTAAT ACGACTCACT ATAGGGAGAC CCAAGCTTGG TACCAATTTA AATTGATATC 1020 

TCCTTAGGTC TCGAGCACCA TGAAGTTGCC TGTTAGGCTG TTGGTGCTGA TGTTCTGGAT 1080 

TCCTGCTTCC AGCAGTGATG TTGTCATGAC CCAAACCCCA CTGTCCAGTC CTGTCACGCT 1140 

TGGACAACCT GCGTCCATCT CTTGCAGATC TAGTCAGATC ATTGTACATA ATAATGGCAA 12 00 

CACCTATCTG GAATGGTACC AGCAGAGACC AGGGCAGTCT CCACGGCTCC TGATCTACAA 1260 

AGTTTCCAAC CGATTTTCTG GGGTCCCAGA CAGGTTCAGC GGCAGTGGAG CTGGGACAGA 1320 

TTTCACACTC AAGATCAGCA GAGTGGAGGC TGAGGATGTG GGAGTTTACT ACTGCTTCCA 1380 

GGGTTCACAT GTTCCATTCA CGTTCGGCCA AGGGACAAAG TTGGAAATCA AACGTAAGTC ' 1440 
TCGAGTCTCT- . AGATAACCGG TCAATCGATT GGAATTCTAA ACTCTGAGGG GGTCGGATGA . 1500 

C GTGGC C AT-T-v- GT.TTGCGTAA AGC ATTGAGT - T.TACTGC AAG . GTC AGAAAAG C ATGC AAAGC 1560 

-CCTCAGAATG- 'GCTGGAAAGA GCTGGAACAA- AACAATTT-AG - AACTTTATTA -AGGAATAGGG 1-620 - 

GGAAGCTAGG" AAGAAACTCA" AAACATCAAG -"ATTTTAAATA CGCTTCTTGG TCTCCTTGCT 1680 

"ATAATH^A ^CU.- • GGGA^iAAG CTP TGCTCTTTTC^TGTCTGTCGC TAACATGCCC TTATCCGCAA ~ l"740 w 

"ACAACACAC^CAA'GGGCAGA^"AOTTTGTTAC"^IT7VAACACCA-" TCCTGTTTGC "TTCTTTCCTC ""1~80'0~ 

AGGAACTGTG GCTGCACCAT CTGTCTTCAT CTTCCCGCCA* TCTGATGAGC AGTTGAAATC . 1860 

TGGAACTGCC TCTGTTGTGT "GCCTGCTGAA TAACTTCTAT" CCCAGAGAGG CCAAAGTACA 1920 

GTGGAAGGTG- GATAACGCCC TCCAATCGGG TAACTCCGAG GAGAGTGTCA CAGAGCAGGA 1980 

GAGCAAGGAC AGCACCTACA GCCTCAGCAG CACCCTGACG CTGAGCAAAG CAGACTACGA 2 040 

GAAACACAAA GTCTACGCCT GCGAAGTCAC CCATCAGGGC CTGAGCTCGC CCGTCACAAA 2100 

GAGCTTCAAC AGGGGAGAGT GTTAGAGGGA GAAGTGCCCC CACCTGCTCC TCAGTTCCAG 2160 

CCTGACCCCC TCCCATCCTT TGGCCTCTGA CCCTTTTTCC ACAGGGGACC TACCCCTATT - 2220 

GCGGTCCTCC AGCTCATCTT TCACCTCACC CCCCTCCTCC TCCTTGGCTT TAATTATGCT 22 80 

AATGTTGGAG GAGAATGAAT AAATAAAGTG AATCTTTGCA CCTGTGGTTT CTCTCTTTCC 2340 

TCATTTAATA ATTATTATCT GTTGTTTTAC CAACTACTCA ATTTCTCTTA TAAGGGACTA 2400 

AATATGTAGT CATCCTAAGG CACGTAACCA TTTATAAAAA TCATCCTTCA TTCTATTTTA 2460 

CCCTATCATC CTCTGCAAGA CAGTCCTCCC TCAAACCCAC AAGCCTTCTG TCCTCACAGT 2520 

CCCCTGGGCC ATGGTAGGAG AGACTTGCTT CCTTGTTTTC CCCTCCTCAG CAAGCCCTCA 2580 

TAGTCCTTTT TAAGGGTGAC AGGTCTTACA GTCATATATC CTTTGATTCA ATTCCCTGAG 2640 

AATCAACCAA AGCAAATTTT TCAAAAGAAG AAACCTGCTA TAAAGAGAAT CATTCATTGC 2700 

AACATGATAT AAAATAACAA CACAATAAAA GCAATTAAAT AAACAAACAA TAGGGAAATG 2760 

TTTAAGTTCA TCATGGTACT TAGACTTAAT GGAATGTCAT GCCTTATTTA CATTTTTAAA 2820 

CAGGTACTGA GGGACTCCTG TCTGCCAAGG GCCGTATTGA GTACTTTCCA CAACCTAATT 2 880 

TAATCCACAC TATACTGTGA GATTAAAAAC ATTCATTAAA ATGTTGCAAA GGTTCTATAA 2940 

AGCTGAGAGA CAAATATATT CTATAACTCA GCAATCCCAC TTCTAGATGA CTGAGTGTCC 3000 

CCACCCACCA AAAAACTATG CAAGAATGTT C AAAGC AGC T TTATTTACAA AAGCCAAAAA 3060 

TTGGAAATAG CCCGATTGTC CAACAATAGA ATGAGTTATT AAACTGTGGT ATGTTTATAC 3120 

ATTAGAATAC CCAATGAGGA GAATTAACAA GCTACAACTA TACCTACTCA CACAGATGAA 3180 

TCTCATAAAA ATAATGTTAC ATAAGAGAAA CTCAATGCAA AAGATATGTT CTGTATGTTT 3240 

TCATCCATAT AAAGTTCAAA ACCAGGTAAA AATAAAGTTA GAAATTTGGA TGGAAATTAC 33 00 
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TCTTAGCTGG* GGGTGGGCGA GTTAGTGCCT GGGAGAAGAC AAGAAGGGGC TTCTGGGGTC 
TTGGTAATGT TCTGTTCCTC GTGTGGGGTT GTGCAGTTAT GATCTGTGCA CTGTTCTGTA 
TACACATTAT GCTTCAAAAT AACTTCACAT AAAGAACATC TTATACCCAG TTAATAGATA 
GAAGAGGAAT AAGTAATAGG TCAAGACCAA CGCAGCTGGT AAGTGGGGGC CTGGGATCAA 
ATAGCTACCT GCCTAATCCT GCCCWCTTGA GCCCTGAATG AGTCTGCCTT CCAGGGCTCA 
AGGTGCTCAA CAAAACAACA GGCCTGCTAT TTTCCTGGCA TCTGTGCCCT GTTTGGCTAG 
CTAGGAGCAC ACATACATAG AAATTAAATG AAACAGACCT TCAGCAAGGG GACAGAGGAC 
AGAATTAACC TTGCCCAGAC ACTGGAAACC CATGTATGAA CACTCACATG TTTGGGAAGG 
GGGAAGGGCA CATGTAAATG AGGACTCTTC CTCATTCTAT GGGGCACTCT GGCCCTGCCC 
CTCTCAGCTA CTCATCCATC CAACACACCT TTCTAAGTAC CTCTCTCTGC CTACACTCTG 
AAGGGGTTCA GGAGTAACTA ACACAGCATC CCTTCCCTCA AATGACTGAC AATCCCTTTG 
TCCTGCTTTG TTTTTCTTTC CAGTCAGTAC TGGGAAAGTG GGGAAGGACA GTCATGGAGA 
AAC T AC AT AA GGAAGCACCT TGCCCTTCTG CCTCTTGAGA ATGTTGATGA GTATCAAATC 
TTTCAAACTT TGGAGGTTTG AGTAGGGGTG AGACTCAGTA ATGTCCCTTC CAATGACATG 
AACTTGCTCA CTCATCCCTG GGGGCCAAAT TGAACAATCA AAGGCAGGCA TAATCC AGTT 
ATGAATTCTT GCGGCCGCTT GCTAGCTTCA CGTGTTGGAT CCAACCGCGG AAGGGCCCTA 
TTCTATAGTG TCACCTAAAT GCTAGAGCTC GCTGATCAGC CTCGACTGTG CCTTCTAGTT 
GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 
CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 
CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 
GGCATGCTGG GGATGCGGTG GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT 
CTAGGGGGTA TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA 
CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 
CTTCCTTTCT CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA 
ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA 
GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG 
.CCGCCTCGGC- CTCTCAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT . 
TTTGCAAAAA - GCTTGGACAG ' CTCAGGGCTG CGATTTCGCG . CC AAACTTGA CGGCAATCCT - 
-AGCGTGAAGG^CTGGTAGGAT • TTTATCCGCG GTGCC ATG AT * GGTTCGACCA -TTGAACTGCA- 
TCGTCGCCGT GTCCCAAAAT" ATGGGGATTG GCAAGAACGG AGACCTACCC TGGCCTCCGC • 
TGAGGAAGGA GTTCAAGTAC ^TTCCAAAGAA TGACCACAAC GTCTTCAGTG GAAGGTAZEAC - 
AGAATCTGGT"~G A ' i"l ' A ' l 'GGGT ~AGGAAAACCT J "GGTTCTCCAT" "*TCCTGAGAAG AATCGACCTTT 
fTAAAGGACAG AATTAATATA GTTCTCAGTA GAGAACTCAA AGAACCACCA CGAGGAGCTC 
ATTTTCTTGC "CAAAAGTTTG^'GATGATGCCT' TAAGACTTAT' TGAACAACCG GAATTGGCAA' 
GTAAAGTAGA CATGGTTTGG ATAGTCGGAG GCAGTTCTGT TTACCAGGAA GCCATGAATC 
AACCAGGCCA CCTTAGACTC TTTGTGACAA GGATCATGCA GGAATTTGAA AGTGACACGT 
TTTTCCCAGA AATTGATTTG GGGAAATATA AACTTCTCCC. AGAATACCCA GGCGTCCTCT 
CTGAGGTCCA GGAGGAAAAA GGCATCAAGT ATAAGTTTGA AGTCTACGAG AAGAAAGACT 
AACAGGAAGA TGCTTTCAAG TTCTCTGCTC CCCTCCTAAA GCTATGCATT TTTATAAGAC 
CATGGGACTT TTGCTGGCTT TAGATCTCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC 
ATAATTGGAC AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG 
TGTATAATGT GTTAAACTAC TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG 
AACTGATGAA TGGGAGCAGT GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA 
AGAAATGCCA TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA 
AAAGAAGAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG 
TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA CAAAGGAAAA 
AGCTGCACTG CTATACAAGA AAATTATGGA AAAATATTCT GTAACCTTTA TAAGTAGGCA 
TAACAGTTAT AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC 
TATTAATAAC TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA 
TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG 
TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 
TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 
ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT 
CCAAACTCAT CAATGTATCT TATCATGTCT GGATCGGCTG GATGATCCTC CAGCGCGGGG 
ATCTCATGCT GGAGTTCTTC GCCCACCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 
AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT 
GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG TATACCGTCG ACCTCTAGCT 
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AGAGCTTGGC GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA 6780 

TTGCACACAA CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA 6840 

GCTAACTCAC ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT 6900 

GCCAGCTGCA TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT 6960 

CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 7020 

CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA 7 080 

ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT 7140 

TTTTCCATAG GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 7200 

GGCGAAACCC GACAGGACTA T AAAG AT AC C AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 7260 

GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA 7320 

GCGTGGCGCT TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT 7380 

CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA 7440 

ACTATCGTCT TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 7 500 

GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 7 560 

CTAACTACGG C TAG AC TAG A AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 7 620 

CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG 7 680 

GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 7740 

TGATCTTTTC TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 7 800 

TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA 7860 

AATCAATCTA AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG 7920 

AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG, 7980 

TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC 8040 

GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG 8100 

AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG 8160 

AAGCTAGAGT AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG 8220 

GCATCGTGGT GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT 8280 

.. CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC „TT_CGGTCCTC . 834.0, 
-CGATCGTTGT CAGAAGTAAG " TTGGCCGCAG * TGTTATCACT .CATGGTTATG* GCAGCACTGC - ' 8400 

•'-ATAATTCTCT • TACTGTCATG— CCATCCGTAA-- GATGCTTTTC - TGTGACTGGT - GAGTAGTGAA - >8460- 

. CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 5520 

GGGATTAATAC CGCGCCACAT " AGCAGAAGTT TAAAAGTGGT^X^TCATTGGSr^AAACGCT-CT-T .:8-58fl 

~CGGGGCGAAA'7OTCTCAAGG~ATCTT^ 864-0" 

GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG VTGAGCAAAAA 87;00 

CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT7 TGAATACTCA 8760 

' TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 8820* 

ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 8880 

AAGTGCCACC TGACGTC 8897 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GAAGAGGAAG ACTGACGGTG CCCCCGCGAG TTCAGGTGCT GAGG 44 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCTCAGCACC TGAACTCGCG GGGGCACCGT CAGTCTTCCT CTTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTGGGAGGGC TTTGTTGGAG ACCGAGCACG AGTACGACTT GCCATTCAGC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 3 0 base pairs _ . 

(B) -TYPE: nucleic : acid- -l^; - . ■ "v-.. • ■„ . 

v -~ (O)'- -STRANDEDNESS :- single— — .- -r.- «. -~ - 

- (D) TOPOLOGY: " linear ~ ' 

-(alT"HOIi"ECULE TYPE .cDNA . — — — 

. ... **'."■.■"- -t- ■ , ~ v . *■ 

(xi) SEQUENCE DESCRIPTION:'; 'SEQ ID NO,: 1-7': 
GATGGTTTTC TCGATGGCGG CTGGGAGGGC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCCTCCCAG CCGCCATCGA GAAAACCATC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
GATGGTTTTC TCGATAGCGG CTGGGAGGGC TTTG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

GATGGTTTTC TCGATGGCGG CTGGGAGGGC TTTGTTGGAG ACCGAGCACG AGTACGACTT 
GCCATTCAGC CAGTCCTGGT G 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

.(A). LENGTH: 81 base pairs. 
■: ~{B)~^TY-PE: ■ nucleic?" acid' . ■-■ 

---- ~{CK STRANDEDNESS-:— sdng-le- ■ — - - 

" • (D) - TOPOLOGY: linear 

" " ( idj - MOXTECUEE'^T YREY ~c DNA* " * ~ ' ' — ' ™ 

:**'.'• - - * " 

: (xi)" SEQUENCE- "DESCRIPTION: SEQ ■ ID NO : 21 : 

CACCAGGACT GGCTGAATGG CAAGTCGTAC TCGTGCTCGG TCTCCAACAA AGCCCTCCCA 
GCCGCCATCG AGAAAACCAT C 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 8690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID -NO: 22: 



GGTACCAATT TAAATTGATA TCTCCTTAGG 
TTGGAATTCT TGCGGCCGCT TGCTAGCCAC 
TGTCCTTGTT TTAAAAGGTG TCCAGTGTGA 
AGTGCAGCCT GGAGGGTCCC TGCGACTTTC 
CTATTACATG TATTGGGTTC GCCAGGCTCC 
TAGTCAAGAT GGTGATATAA CCGACTATGC 
CAGAGACAAT GCAAAGAACA GCCTGTACCT 



TCTCGAGTCT CTAGATAACC GGTCAATCGA 
CATGGAGTTG TGGTTAAGCT TGGTCTTCCT 
AGTGCAACTG GTGGAGTCTG GGGGAGGCTT 
CTGTGCTGCA TCTGGATTCC CGTTCAGTGA 
AGGCAAGGGA CTGGAGTGGG TCTCATACAT 
AGACTCCGTA AAGGGTCGAT TCACCATCTC 
GCAAATGAAC AGCCTGAGGG ACGAGGACAC 
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AGCCGTGTAT TACTGTGCAA GAGGCCTGGC GGACGGGGCC TGGTTTGCTT ACTGGGGCCA 480 

AGGGACTCTG GTCACGGTCT CTTCCGCTAG CACCAAGGGC CCATCGGTCT TCCCCCTGGC 540 

ACCCTCCTCC AAGAGCACCT CT.GGGGGCAC AGCGGCCCTG GGCTGCCTGG TCAAGGACTA 600 

CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC 660 

CTTCCCGGCT GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TCACCGTGCC '720 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC 780 

CAAGGTGGAC AAGAAAGTTG GTGAGAGGCC AGCACAGGGA GGGAGGGTGT CTGCTGGAAG 840 

CCAGGCTCAG CGCTCCTGCC TGGACGCATC CCGGCTATGC AGCCCCAGTC CAGGGCAGCA 900 

AGGCAGGCCC CGTCTGCCTC TTCACCCGGA GGCCTCTGCC CGCCCCACTC ATGCTCAGGG 960 

AGAGGGTCTT CTGGCTTTTT CCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC 1020 

CAGGCCCTGC ACACAAAGGG GCAGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG .1080 

AGGACCCTGC CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG 1140 

GACACCTTCT CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT 1200 

CTTGTGACAA AACTCACACA TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT 1260 

CCAGCTCAAG GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAG GCCCCAGCCG 1320 

GGTGCTGACA CGTCCACCTC CATCTCTTCC TCAGCACCTG AACTCCTGGG GGGACCGTCA 1380 

GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA TCTCCCGGAC CCCTGAGGTC 1440 

ACATGCGTGG TGGTGGACGT GAGCCACGAA GACCCTGAGG TCAAGTTCAA CTGGTACGTG 1500 

GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG AGGAGCAGTA CAACAGCACG 1560 

TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT GGCTGAATGG CAAGGAGTAC 1620 

AAGTGCAAGG TCTCCAACAA AGCCCTCCCA GCCCCCATCG AGAAAACCAT CTCCAAAGCC 1680 

AAAGGTGGGA CCCGTGGGGT GCGAGGGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC 1740 

TGCCCTGAGA GTGACCGCTG TACCAACCTC TGTCCCTACA GGGCAGCCCC GAGAACCACA 1800 

GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG AACCAGGTCA GCCTGACCTG 1860 

CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG TGGGAGAGCA ATGGGCAGCC 192 0 

GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC GACGGCTCCT TCTTCCTCTA 1980 

- . CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG AACGTCTTCT CATGCTCCGT .2Q4.0_ 

^r 1 ,-- < GATGCATGAG* GCTCTGCACA ACCACTACAC GCAGAAGAGC CTCTCCCTGT CTCCGGGTAA .„ /2100- 

ATGAGTGOGA* GGGCCGGCAA GCCCCCGCTC CCCGGGGTCT CGGGGTGGCA- CGAGGAT'GCT* -- -21-60- 

TGGCACGTAC CCCCTGTACA TACTTCCCGG GCGCCCAGCA TGGAAATAAA GCACCCAGCG 222 0 

: : _rzr_ ;C~TGCCGTGGG CGGCTGCGAG -ACTGTGATGG TTCTTTCCAC GGGTCAGGCC . GAGTCTGAGG - .2280* 

~CCTGAGTGGC*~ATGAGGGAGG 'CAGAGCGGGT CCCACTGTCC^CCACACTGGC CCAGGCTGTG" ~2r3:4-0~ 

' * CAGGTGTGCC TGGGCCCCGT AGGGTGGGGC TCAGCCAGGG GCTGCCCTCG GCAGGGTGGG 2400 

— GGATTTGCCA GCGTGGCCCT ' CCCTCCAGCA GCACCTGCCC TGGGCTGGGC CACGGGAAGC' 2460'. 

CCTAGGAGCC CCTGGGGACA GACACACAGC CCCTGGCTCT GTAGGAGACT GTCCTGTTCT 2 520 

GTGAGCGCCC CTGTCCTCCC GACCTCCATG CCCACTCGGG GGCATGCCTA GTCCATGTGC 2580 

GTAGGGACAG GCCCTCCCTC ACCCATCTAC CCCCACGGCA CTAACCCCTG GCTGCCCTGC 2 640 

CCAGCCTCGC ACCCGCATGG GGACACAACC GACTCCGGGG ACATGCACTC TCGGGCCCTG 2700 

TGGAGGGACT GGTGCAGATG CCCACACACA CACTCAGCCC AGACCCGTTC AACAAACCCC 2760 

GCACTGAGGT TGGCCGGCCA CACGGCCACC ACACACACAC GTGCACGCCT CACACACGGA 2820 

GCCTCACCCG GGCGAACTGC ACAGCACCCA GACCAGAGCA AGGTCCTCGC ACACGTGAAC 2880 

ACTCCTCGGA CACAGGCCCC CACGAGCCCC ACGCGGCACC TCAAGGCCCA CGAGCCTCTC 2940 

GGCAGCTTCT CCACATGCTG ACCTGCTCAG ACAAACCCAG CCCTCCTCTC ACAAGGGTGC 3000 

CCCTGCAGCC GCCACACACA CACAGGGGAT CACACACCAC GTCACGTCCC TGGCCCTGGC 3 060 

CCACTTCCCA GTGCCGCCCT TCCCTGCAGG ACGGATCAGC CTCGACTGTG CCTTCTAGTT 312 0 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3180 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3240 

CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 3300 

GGCATGCTGG GGATGCGGTG GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT 3360 

CTAGGGGGTA TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA 3420 

CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 3480 

CTTCCTTTCT CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA 3540 

A ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA 3 600 

GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG 3660 

CCGCCTCGGC CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT 3720 

TTTGCAAAAA GCTTGGACAG CTCAGGGCTG CGATTTCGCG CCAAACTTGA CGGCAATCCT 3780 

AGCGTGAAGG CTGGTAGGAT TTTATCCCCG CTGCCATCAT GGTTCGACCA TTGAACTGCA 3 840 
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TCGTCGCCGT 

at* X ^w* ^w* ^k*» ^k"» 44k 


GTCCCAAAAT 


ATGGGGATTG 


GCAAGAACGG 


AGACCTACCC 


TGGCCTCCGC 


3900 


TCAGGAACGA 

pi* ^4* X ^ . -* ^4* X X^ * ^a*" * * 


GTTCAAGTAC 


TTCCAAAGAA 


TGACCACAAC 


CTCTTCAGTG 


GAAGGTAAAC 


3960 . 


AGAATCTGGT 

A- X * A Xx X -* pX ^ta^ 4441 


GATTATGGGT 


AGGAAAACCT 


GGTTCTCCAT 


TCCTGAGAAG 


AATCGACCTT 


4020 


TAAAGGACAG 

A. U U A *- -* X A J- X ^k*f 


AATTAATATA 


GTTCTCAGTA 


GAGAACTCAA 


AGAACCACCA 


C.GAGGAGCTC 


4080 


ATTTTCTTGC 

* X -X X, X- X %p»- X. X> >— ' 


CAAAAGTTTG 


GATGATGCCT 


TAAGACTTAT 


TGAACAACCG 


GAATTGGCAA 


4140 


GTAAAGTAGA 

-X X XX J X^*** aX A X X A 


CATGGTTTGG 


ATAGTCGGAG 


GCAGTTCTGT 


TTACCAGGAA 


GCCATGAATC 


4200 


AACCAGGCCA 


CCTTAGACTC 


TTTGTGACAA 


GGATCATGCA 


GGAATTTGAA 


AGTGACACGT 


4260 


TTTTCCCAGA 

«k ppW X- X- ^k** ^4l* ^ka* X 4>- ^k«* 4*- a» 


AATTGATTTG 


GGGAAATATA 


AACTTCTCCC 


AGAATACCCA 


GGCGTCCTCT 


4320 


CTGAGGTCCA 


GGAGGAAAAA 


GGCATCAAGT 


ATAAGTTTGA 


AGTCTACGAG 


AAGAAAGACT 


4380 


AACAGGAAGA 

X XX X X X^_J ^b*» X A* * X A 


TGCTTTCAAG 


TTCTCTGCTC 


CCCTCCTAAA 


GCTATGCATT 


TTTATAAGAC 


4440 


CATGGGACTT 

v - X X X> ^a*f V— J ^kpr X A ^4* aX 


TTGCTGGCTT 


TAGATCTCTT 


TGTGAAGGAA 


CCTTACTTCT 


GTGGTGTGAC 


4500 


ATAATTGGAC 

X X ^b> X U X •*<- ^p* X X ^pp 


AAACTACCTA 


CAGAGATTTA 


AAGCTCTAAG 


GTAAATATAA 


AATTTTTAAG 


4560 


TGTATAATGT 

Xr v - 1 X> X X X U X ^b- \J 4k- 


GTTAAACTAC 


TGATTCTAAT 


TGTTTGTGTA 


TTTTAGATTC 


CAACCTATGG 


4620 


AACTGATGAA 


TGGGAGCAGT 


GGTGGAATGC 


CTTTAATGAG 


GAAAACCTGT 


TTTGCTCAGA 


4680 


AGAAATGCGA 


TCTAGTGATG 

X> _ ~* ^kr X X X- X X 7 | 1* 


ATGAGGCTAC 

4k 4k 4b *Jk4* 4k 4Pi ^4* ^ka* ^a' 444 4k 4k ^pT 


TGCTGACTCT 


CAACATTCTA 


CTCCTCCAAA 


4740 


AAAGAAGAGA 

X XX XX A>i_J X XX X X X ^k*?X X 


AAGGTAGAAG 

X XX X ^b X X ^» 


ACCCCAAGGA 


CTTTCCTTCA 


GAATTGCTAA 


GTTTTTTGAG 


4800 


TCATGCTGTG 

X» ^ap* X X X aX * 


TTTAGTAATA 

^b ^b X X «^ A ^ ^k 


GAACTCTTGC 


TTGCTTTGCT 


ATTTACACCA 


CAAAGGAAAA 


4860 


AGCTGCACTG 

X X V— * X ^p#\^a*X X^a* A ^w* 


CTATACAAGA 


AAATTATGGA 


AAAATATTCT 


GTAACCTTTA 


TAAGTAGGCA 


4920 


TAACAGTTAT 

X- X XX X X X- X X X X 


AATCATAACA 

4- ^k ^k> A- -^ki -tfk. 4»4b 4* ^k 


TACTGTTTTT 


TCTTACTCCA 


CACAGGCATA 


GAGTGTCTGC 


4980 


TATTAATAAC 

Xi X X Xi Xi X XX X X X XX X Vvr 


TATGCTCAAA 

X X X ^Lh «X & XX XX X 


AATTGTGTAC 

4k 4BHB- 4k 4Bb> 4BV ^4«I 444- ^a«I 4BB- 4k 4k ^4*^ 


CTTTAGCTTT 


TTAATTTGTA 


AAGGGGTTAA 


5040 


TAAGGAATAT 

X> X XX X^aV JX XX X aX X X aX 


TTGATGTATA 

^ki ^b X X «4v ^k- K M ^kk A 


GTGCCTTGAC 


TAGAGATCAT 


AATCAGCCAT 


ACCACATTTG 


5100 


TAGAGGTTTT 

Xi X X X X >— 4X aX aX 4k 


ACTTGCTTTA 

X X ^ta^ ^b A- ^ki ^ki ^b » * 


AAAAACCTCC 


CACACCTCCC 


CCTGAACCTG 


AAACATAAAA 


5160 


TGAATGCAAT 

X- >— * X XX X X *>w* X XX X Xi 


TGTTGTTGTT 

^b ^b ^b ^b ^b ^ta^ 4^ 


AACTTGTTTA 


TTGCAGCTTA 


TAATGGTTAC 


AAATAAAGCA 


5220 


ATAGCATCAC 

X X X X X ^wX X X^ ^wX X >— * 


AAATTTCACA 


AATAAAGCAT 


TTTTTTCACT 


GCATTCTAGT 


TGTGGTTTGT 


. 5280 


CCAAACTCAT 

^ . - X XX XX X v *X ^ta-" X X aX 


CAATGTATCT 

^tair X XX X ^b fk A * ^b ^k> 


TATCATGTCT 


GGATCGGCTG 


GATGATCCTC 


CAGCGCGGGG 


5340 


ATCTCATGCT 

X X X> X> X— * X X 4X X»»» aX 


GGAGTTCTTC 


GCCCACCCCA 


ACTTGTTTAT 


TGCAGCTTAT 


AATGGTTACA 


5400 


AATAAAGCAA 

* - -.- ' w * - ■» t X XX X X X XX XX X ^a*" X XX X 


TAGCATCACA 


AATTTCACAA 


ATAAAGCATT 


TTTTTCACTG 


CATTCTAGTT 


5.46.0, 


- ' L - j.-" ' ' GTGGTTTGTC 

\^p# X> N^pf ^^pf X 4X1 ^b> •* ^L- ^k** 


CAAACTCATG 


AATGTATCTT 


-ATCATGTCTG". 


. TATACCGTCG 


ACCTCTAGCT 


5520 . 


x. . . AGAGCTTGGC 


GTAATCATGG 


-TCATAGGTGT - 


TTCCTGTGTG 


-AAATTGTTAT ■ 


CCGCTGACAA 


■ -5580 • 


•TTCCACACAA 

* X> X> X X >w X X >— ' X XX X 


CATACGAGCC 


GGAAGG ATAA " 


AGTGTAAAGC 


" CTGGGGTGCC 


TAATGAGTGA 


-.5640 . 


GGTAACTCAC 

* X X XX X X- X— ^* X X 


'ATT AATTGC G 

X X ^b ^b X XX 4*. «b. 


TTGGGCTCAC 


--TGCCCGCTTT- 


~:CCAGTCGGGA 


AACCTGTCGT 


rrs-zoo 


— B "GCCAGCTGCA" 


"TTAATGAATC 


^GGCCAACGCG"XGGGGAGAGG CGGTTTGCGT" 


-ATTGGGCGCT- 


—5-7*60 " 


CTTCtJGCTTC' 


GTCGCTCACT 


GACTCGCTGC 


GCTCGGTCGT 


TCGGCTGGGG 


C.GAGGGJGTAT. 


5820 


' CAGCTCACTC 


AAAGGCGGTA 


ATACGGTTAT 


GCACAGAATC 


AGGGGATAAC ' 


' GCAGGAAAGA 


5880 


' ■ "ACATGTGAGC 

X X W X X X Xi >_J X X Nw» 


AAAAGGCCAG 

X XX XX XX X ^^4* X X ^ * 


CAAAAGGCCA 


GGAACCGTAA 


AAAGGCCGCG 


TTGCTGGCGT 


5940 


TTTTCCATAG 

X> X> X> X Nu< X X Xr X X 


GCTCCGCCCC 


CCTGACGAGC 


ATCACAAAAA 


TCGACGCTCA 


AGTCAGAGGT 


6000 


GGCGAAACCC 

-* •* ^- . - X XX XX X 


GACAGGACTA 

^^rf 4b ^ ^k^ 4b «m. Ik* ^kar -4b ^4#* ^k -ak ^k 


TAAAGATACC 


AGGCGTTTCC 


CCCTGGAAGC 


TCCCTCGTGC 


6060 


GCTCTCCTGT 


TCCGACCCTG 

4pV ^ri' ^4*" 4k ■■■■ ^kBF" 44ki 


CCGCTTACCG 


GATACCTGTC 


CGCCTTTCTC 


CCTTCGGGAA 


6120 


GCGTGGCGCT 


TTCTCAATGC 


TCACGCTGTA 


GGTATCTCAG 


TTCGGTGTAG 


GTCGTTCGCT 


6180 


CCAAGCTGGG 


CTGTGTGCAC 


GAACCCCCCG 


TTCAGCCCGA 


CCGCTGCGCC 


TTATCCGGTA 


6240 


ACTATCGTCT 

X X ^L, X X X ^ - ^- . -* ^k ^w*- 


TGAGTCCAAC 

44k ^k4# 4k 4> ^k4T 44H ^44< ^44F ^k 44b 4t 


CCGGTAAGAC 


ACGACTTATC 


GCCACTGGCA 


GCAGCCACTG 


6300 


GTAACAGGAT 


TAGCAGAGCG 


AGGTATGTAG 


GCGGTGCTAC 


AGAGTTCTTG 


AAGTGGTGGC 


6360 


CTAACTACGG 

X XX X X- X * > * > * 


CTACACTAGA 


AGGACAGTAT 


TTGGTATCTG 


CGCTCTGCTG 


AAGCCAGTTA 


6420 


CCTTCGGAAA 

X> X X XX XX X 


AAGAGTTGGT 

4> P P> 4» ^ha" 1 4k 4k ^pP 1 44k 444- ^4P" 4Pb 


AGCTCTTGAT 


CCGGCAAACA 


AACCACCGCT 


GGTAGCGGTG 


6480 


GTTTTTTTGT 

\_J X- X. X X Xi X- Xi X» 


TTGCAAGCAG 

44b ^b *bp4" ^Ippr 4b 4V4P- H%- ^44r ^kp* 4™" 4k ^4P 


CAGATTACGC 


GCAGAAAAAA 


AGGATCTCAA 


GAAGATCCTT 


6540 


TGATCTTTTC 


TACGGGGTCT 


GACGCTCAGT 


GGAACGAAAA 


CTCACGTTAA 


GGGATTTTGG 


6600 


TCATGAGATT 

X- X X X X X > * X X ^ 


ATCAAAAAGG 


ATCTTCACCT 


AGATCCTTTT 


AAATTAAAAA 


TGAAGTTTTA 


6660 


AATCAATCTA 

X XX X ~ X XX X ^k ^ X X 


AAGTATATAT 

4b- 444k ^k *kp|l 4Bb 4b 4b -4Br 4k ak ^Ba 4t 4k 444 


GAGTAAACTT 


GGTCTGACAG 


TTACC AATGC 


TTAATCAGTG 


6720 


AGGCACCTAT 


CTCAGCGATC 


TGTCTATTTC 


GTTCATCCAT 


AGTTGCCTGA 


CTCCCCGTCG 


6780 




TACGATACGG 


GAGGGCTTAC 


CATCTGGCCC 


CAGTGCTGCA 


ATGATACCGC 


6840 


GAGACCCACG 


CTCACCGGCT 


CCAGATTTAT 


CAGCAATAAA 


CCAGCCAGCC 


GGAAGGGCCG 


6900 


AGCGCAGAAG 


TGGTCCTGCA 


ACTTTATCCG 


CCTCCATCCA 


GTCTATTAAT 


TGTTGCCGGG 


6960 


AAGCTAGAGT 


AAGTAGTTCG 


CCAGTTAATA 


GTTTGCGCAA 


CGTTGTTGCC 


ATTGCTACAG 


7020 


GCATCGTGGT 


GTCACGCTCG 


TCGTTTGGTA 


TGGCTTCATT 


CAGCTCCGGT 


TCCCAACGAT 


7080 


CAAGGCGAGT 


TACATGATCC 


CCCATGTTGT 


GCAAAAAAGC 


GGTTAGCTCC 


TTCGGTCCTC 


7140 


CGATCGTTGT 


CAGAAGTAAG 


TTGGCCGCAG 


TGTTATCACT 


CATGGTTATG 


GCAGCACTGC 


7200 


ATAATTCTCT 


TACTGTCATG 


CCATCCGTAA 


GATGCTTTTC 


TGTGACTGGT 


GAGTACTCAA 


7260 
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CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 7 320 

GGGATAATAC CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 7 380 

CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC 7 440 

GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA 7 500 

CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTCA 7 560 

TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 7 620 

ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 7 680 

AAGTGCCACC TGACGTCGAC GGATCGGGAG ATCTGCTAGG TGACCTGAGG CGCGCCGGCT 7740 

TCGAATAGCC AGAGTAACCT TTTTTTTTAA TTTTATTTTA TTTTATTTTT GAGATGGAGT 7 800 

TTGGCGCCGA TCTCCCGATC CCCTATGGTC GACTCTCAGT ACAATCTGCT CTGATGCCGC 7 860 

ATAGTTAAGC CAGTATCTGC TCCCTGCTTG TGTGTTGGAG GTCGCTGAGT AGTGCGCGAG 7920 

CAAAATTTAA GCTACAACAA GGCAAGGCTT GACCGACAAT TGCATGAAGA ATCTGCTTAG 7980 

GGTTAGGCGT TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT GACATTGATT 8040 

ATTGACTAGT TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA 8100 

GTTCCGCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 8160 

CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG 8220 

ACGTCAATGG GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA 8280 

TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC 8340 

CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC 8400 

TATTACCATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 8460 

ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA 8520 

TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG 8580 

GCGTGTACGG TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG AACCCACTGC 8640 

TTACTGGCTT ATCGAAATTA ATACGACTCA CTATAGGGAG ACCCAAGCTT 8690 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) ' SEQUENCE CHARACTERISTICS : * - . r 

- .. , {-A )*-- LENGTH : - 7*874 base- pairs • • - ~- .- ' 

: {B) TYPE: nucleic ac-icT"-" 
• • r --toy" STRANDEDNESS-:- -single—---- - ~\ *r z:r:z , ; 
" — - r — -fDj- TOPOLOGY : _ linear. ZTT * ~' — T " * " " ~ ' ' " " 

(ii) " MOLECULE TYPE: cDNA ■* ' .......... 

"■''■.*. • " ■ - I* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGTCT CTAGATAACC GGTCAATCGA 60 

TTGGAATTCT TGCGGCCGCT TGCTAGCACC AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 120 

TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT GCCTGGTCAA GGACTACTTC 180 

CCCGAACCGG TGACGGTGTC GTGGAACTCA GGCGCCCTGA CCAGCGGCGT GCACACCTTC 240 

CCGGCTGTCC TACAGTCCTC AGGACTCTAC TCCCTCAGCA GCGTGGTCAC CGTGCCCTCC 300 

AGCAGCTTGG GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG 360 

GTGGACAAGA AAGTTGGTGA GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG 420 

GCTCAGCGCT CCTGCCTGGA CGCATCCCGG CTATGCAGCC CCAGTCCAGG GCAGCAAGGC 480 

AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC CCACTCATGC TCAGGGAGAG 540 

GGTCTTCTGG CTTTTTCCCC AGGCTCTGGG CAGGCACAGG CTAGGTGCCC CTAACCCAGG 600 

CCCTGCACAC AAAGGGGCAG GTGCTGGGCT CAGACCTGCC AAGAGCCATA TCCGGGAGGA 660 

CCCTGCCCCT GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC AGCTCGGACA 720 

CCTTCTCTCC TCCCAGATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC CCAAATCTTG 7 80 

TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA GCCCAGGCCT CGCCCTCCAG 840 

CTCAAGGCGG GACAGGTGCC CTAGAGTAGC CTGCATCCAG GGACAGGCCC CAGCCGGGTG 900 

CTGACACGTC CACCTCCATC TCTTCCTCAG CACCTGAACT CCTGGGGGGA CCGTCAGTCT 960 

TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT GAGGTCACAT 1020 

GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG TACGTGGACG 1080 

GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC AGCACGTACC 1140 

GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG GAGTACAAGT 1200 
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GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC AAAGCCAAAG 12 60 

GTGGGACCCG TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC ACCCTCTGCC 1320 

CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA ACCACAGGTG 1380 

TACACCCTGC CCCCATCCCG GGATGAGCTG ACCAAGAACC AGGTCAGCCT GACCTGCCTG 1440 

GTCAAAGGCT TCTATCCCAG CGACATCGCC GTGGAGTGGG AGAGCAATGG GCAGCCGGAG 1500 

AACAACTACA AGACCACGCC TCCCGTGCTG GACTCCGACG GCTCCTTCTT CCTCTACAGC 1560 

AAGCTCACCG TGGACAAGAG CAGGTGGCAG CAGGGGAACG TCTTCTCATG CTCCGTGATG 1620 

CATGAGGCTC TGCACAACCA CTACACGCAG AAGAGCCTCT CCCTGTCTCC GGGTAAATGA 1680 

GTGCGACGGC CGGCAAGCCC CCGCTCCCCG GGCTCTCGCG GTCGCACGAG GATGCTTGGC 1740 

ACGTACCCCC TGTACATACT TCCCGGGCGC CCAGCATGGA AATAAAGCAC CCAGCGCTGC 1800 

CCTGGGCCCC TGCGAGACTG TGATGGTTCT TTCCACGGGT CAGGCCGAGT CTGAGGCCTG 1860 

AGTGGCATGA GGGAGGCAGA GCGGGTCCCA CTGTCCCCAC ACTGGCCCAG GCTGTGCAGG 192 0 

TGTGCCTGGG CCCCCTAGGG TGGGGCTCAG CCAGGGGCTG CCCTCGGCAG GGTGGGGGAT 1980 
TTGCCAGCGT GGCCCTCCCT CCAGCAGCAC CTGCCCTGGG CTGGGCCACG GGAAGCCCTA . 2 040 

GGAGCCCCTG GGGACAGACA CACAGCCCCT GCCTCTGTAG GAGACTGTCC TGTTCTGTGA 2100 

GCGCCCCTGT CCTCCCGACC TCCATGCCCA CTCGGGGGCA TGCTGGGGAT GCGGTGGGCT 2160 

CTATGGCTTC TGAGGCGGAA AGAACCAGCT GGGGCTCTAG GGGGTATCCC CACGCGCCCT 2220 

GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG 2280 

CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 2340 

GCTTTCCCCG TCAAGCTCTA AATCGGGGCA TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 2400 

GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT 2460 

GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT 2520 

TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT 2580 

TGGGGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT 2 640 

AATTCTGTGG AATGTGTGTC AGTTAGGGTG TGGAAAGTCC CCAGGCTCCC CAGGCAGGCA 27 00 

GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC CTAACTCCGC 2760 . 

CCATCCCGCC CCTAACTCCG . CCCAGTTCCG . CCCATTCTCC GCCCCATGGC TGACTAATTT 2820 . 

.TTTTTATTTA ...-TGGAGAGGCC -GAGGCCGCCT--.CGGCCTCTGA GCTATTCCAG AAGTAGTGAG . *2 880 • - : -: • 4- 

GAGGCTTTTT : TGGAGGCCTA -GGGTTTTGCA ■ AAAAGCTTGG- ACAGCTCAGG GGTGCGATTT- ■ -2940 ~. 

CGCGCC AAAC " "TTGACGGC AA TCCTAGCGTG AAGGCTGGTA GGATTTTATC CCCGCTGCCA 3000 ■ 

TCATGGTTGG^SGGAT^ CCGTGTGCCA AAATATGGGG ATTGGCAAGA™ 3&60; " f~ . 

.ACGGAGAIX^r~7^CCCTGGCCT'~CCGCTCAGGA~ACGAGTTCAA"" GTACTTCCAA AG AATGAC C A~~ ~ — 3T2 0 " ■ . 

CAACCTCTTC AGTGGAAGGT 'AAACAGAATC- TGGTGATTAT GGGTAGGAAA ACCTGGTTCT 3180 . ' .. •/.• . 

* CCATTCCTGA. GAAGAATCGA \CGTTTAAAGG' ACAGAATTAA TATAGTTCTC AGTAGAGAAC 3240 ? 

TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT GCCTTAAGAC 33 00 

TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC GGAGGCAGTT 33 60 

CTGTTTACCA GGAAGCCATG AATCAACCAG GCCACCTTAG ACTCTTTGTG ACAAGGATCA 3420 

TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA TAT AAAC TTC 3480 

TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC AAGTATAAGT 3540 

TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT GCTCCCCTCC 3600 

TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC TCTTTGTGAA 3660 

GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA TTTAAAGCTC 3720 

TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC TAATTGTTTG 37 80 

TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA ATGCCTTTAA 3840 

TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG CTACTGCTGA 3900 

CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA AGGACTTTCC 3960 

TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC TTGCTTGCTT 4020 

TGCTATTTAC ACCACAAAGG AAAAAGC TGC ACTGCTATAC AAGAAAATTA TGGAAAAATA 4080 

TTCTGTAACC TTTATAAGTA GGCATAACAG TTATAATCAT AACATACTGT TTTTTCTTAC 4140 

TCCACACAGG CATAGAGTGT CTGCTATTAA TAACTATGCT CAAAAATTGT GTACCTTTAG 42 00 

CTTTTTAATT TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT TG AC TAG AG A 42 60 

TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC CTCCCACACC 432 0 

TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG TTTATTGCAG 4380 

CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 4440 

CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT GTCTGGATCG 4500 

GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCCAACTTGT 4560 

TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 462 0 
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CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 
TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG 
TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 
AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 
CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 
GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 
TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 
AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG . GCCAGGAACC 
GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 
TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 
TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC 
TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 
CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 
TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 
CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 
TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 
AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 
AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 
AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 
TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 
ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 
CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 
GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 
TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 
TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 
. GCAACGTTGT .TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 
- CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCGATG TTGTGCAAAA 
■ AAGCGGTTAG* CTGCTTCGGT CGTCCGATCG TTGTCAGAAG TAAGTTGGGG GCAGTGTTAT 
CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 
:iLrTTTCTGT.GAC-TGGTGAGTAG TCAACCAAGT CATTCTGAGA r ATAGTGTATG CGGCGACCGA"" 
— GTTGCTCTTG ^CCCGGCGTCA ATACGGGATA" ATACCGCGCC ' ACATAGCAGA ACTTTAAS&G" 
. ..,TGCTCATCAT TGGAAAACGT "TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 
"GATCCAGTTC GATGTAACCC AGTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 
' CCAGCGTTTC - TGGGTGAGGA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 
CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 
AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 
GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT CGACGGATCG GGAGATCTGC 
TAGGTGACCT GAGGCGCGCC GGCTTCGAAT AGCCAGAGTA ACCTTTTTTT TTAATTTTAT 
TTTATTTTAT TTTTGAGATG GAGTTTGGCG CCGATCTCCC GATCCCCTAT GGTCGACTCT 
CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 
GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA 
CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC 
C AG AT AT AC G CGTTGACATT GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC 
ATTAGTTCAT AGCCCATATA TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC 
TGGCTGACCG CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA 
CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG 
TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA 
GTACATCTAC GTATTAGTCA TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA 
TGGGCGTGGA TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 
CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT 
CTGGCTAACT AGAGAACCCA CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG 
GGAGACCCAA GCTT 



4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 
. 6240. 
. V6300 • 
-6360 

6420 

r-6^8'0: -.iTV—/ 

-6:5-4-0- 

6600. 

6660 ' ■; 

6720 • - 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7874 



(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

1 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Pro Phe Ser Asp Tyr 

20 25 30 

Tyr Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 

35 40 45 

Ser Tyr lie Ser Gin Asp Gly Asp lie' Thr Asp Tyr Ala Asp Ser Val 

50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Arg Asp Glu Asp Thr Ala Val Tyr Tyr Cys 

85 90 95 

Ala Arg Gly Leu Ala Asp Gly Ala Trp Phe Ala Tyr Trp Gly Gin Gly 

100 105 110 

Thr Leu Val Thr Val Ser Ser 
115 

*. . '. (2) INFORMATION FOR SEQ I D NO : 2 5' v . rl» -:. . ■ s -■ • \ r, - 



(i) SEQUENCE CHARACTERISTICS: 
(A) .LENGTHS 3 3 0 amino' ae"ids 

" ~ t BT^TYPE : " amino" aaicT" — " ' 
..(C) STRANDEDNESS: single 
(D)' TOPOLOGY: linear' 

(ii) MOLECULE TYPE: protein 



(xi) SEQU 

Ala Ser Thr Lys 
1 

Ser Thr Ser Gly 

20 

Phe Pro Glu Pro 
35 

Gly Val His Thr 
50 

Leu Ser Ser Val 
65 

Tyr lie Cys Asn 

Lys Val Glu Pro 

100 

Pro Ala Pro Glu 
115 

Lys Pro Lys Asp 
130 



1NCE DESCRIPTION 

Gly Pro Ser Val 
5 

Gly Thr Ala Ala 

Val Thr Val Ser 

40 

Phe Pro Ala Val 
55 

Val Thr Val Pro 
70 

Val Asn His Lys 
85 

Lys Ser Cys Asp 

Leu Leu Gly Gly 

120 

Thr Leu Met lie 
135 ■ 



SEQ ID NO: 25: 

Phe Pro Leu Ala 
10 

Leu Gly Cys Leu 
25 

Trp Asn Ser Gly 

Leu Gin Ser Ser 

60 

Ser Ser Ser Leu 
75 

Pro Ser Asn Thr 
90 

Lys Thr His Thr 
105 

Pro Ser Val Phe 

Ser Arg Thr Pro 

140 



Pro Ser Ser Lys 
15 

Val Lys Asp Tyr 
30 

Ala Leu Thr Ser 
45 

Gly Leu Tyr Ser 

Gly Thr Gin Thr 

80 

Lys Val Asp Lys 
95 

Cys Pro Pro Cys 
110 

Leu Phe Pro Pro 
125 

Glu Val Thr Cys 
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Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
145 150 155 160 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 

165 170 175 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 

180 185 190 

His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Asp Lys Val Ser Asn 

195 200 205 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly 

210 215 220 

Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu 
225 230 235 240 

Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 

245 250 255 

Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn 

260 265 270 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 

275 ' 280 285 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 

290 295 300 

Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
305 310 315 320 

Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 

325 330 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i ) SEQUENCE CHARACTERISTICS : - - ■ - ; ... 

* (A)- LENGTH : ~ 2 2 0 -amino acids •• • • ••- 

- (B)~ TYPE: amino acid " . 
r ^™fC) STRANDEDNESS^=^a-hgl^t-"™: z""Z- 
— ^-XD) ' TOPOLOGY ^-linear ------- - 

(ii) MOLECULE TYPE: protein : 

. „ - ■ ■ - - ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID" NO: 2-6: 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys 

15. 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 

20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 

35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 

50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 . 70 75 80 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys 

85 90 95 

Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 

- ' 100 105 110 

Pro Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg 

115 120 125 

Asp Glu Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly 

130 135 140 

Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro 
145 150 155 160 
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Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser 

165 170 175 

Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin- Gin 

180 185 190 

Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His 

195 200 205 

Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
' 210 215 220 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 



Glu 


Val 


Asn 


Leu 


Val 


Glu 


Ser 


Gly 


Gly 


Gly 


Leu 


Val 


Gin 


Pro 


Gly 


Gly 


1 








5 










10 










15 




Ser 


Leu 


Lys 


Val 


Ser 


Cys 


Val 


Thr 


Ser 


Gly 


Phe 


Thr 


Phe 


Ser 


Asp 


Tyr 








20 










25 










30 






Tyr 


Met 


Tyr 


Trp 


Val 


Arg 


Gin 


Thr 


Pro 


Glu 


Lys 


Arg 


Leu 


Glu 


Trp 


Val 






35 










40 










45 








■Ala* 


Tyr- 


;-Iley 


-Ser- 


Gin 


Gly 


Gly Asp 


lie 


Thr 


Asp 


Tyr 


Pro 


Asp 


Thr- 


Val 




-50--' 








* *^ 












60 










Lys 


Gly Arg 


Phe 


Thr 


He 


Ser 


Arg 


Asp 


Asn 


Ala 


Lys 


Asn 


Thr 


Leu 


Tyr 


65: 










. TO' . ' 










75 










80- 


Leu 


Gln^Met*. 


Ser ""Arg" 


"Leu 


"Lys' 


' Ser' 


~Glu 


Asp 


Thr 


*Al'a~ 


'Met 


"Tyr 


-Tyr 


Cys 










85 










90 










95 




Ala 


Arg 


Gly 


Leu- Asp 


Asp" 


Gly Ala 


Trp 


Phe 


Ala 


Tyr 


Trp 


Gly 


Gin 


Gly 








100 










105 










110 






Thr 


Leu 


Val 


Thr 


Val 


Ser 


Val 


Ala 


Ser 


Thr 


Lys 


Gly 


Pro 


Ser 


Val 


Phe 






115 










120 










125 








Pro 


Leu 


Ala 


Pro 


Ser 


Ser 


Lys 


Ser 


Thr 


Ser 


Gly 


Gly 


Thr 


Ala 


Ala 


Leu 




130 










135 










140 










Gly 


Cys 


Leu 


Val 


Lys 


Asp 


Tyr 


Phe 


Pro 


Glu 


Pro 


Val 


Thr 


Val 


Ser 


Trp 


145 










150 










155 










160 


Asn 


Ser 


Gly 


Ala 


Leu 


Thr 


Ser 


Gly 


Val 


His 


Thr 


Phe 


Pro 


Ala 


Val 


Leu 










165 










170 










175 




Gin 


Ser 


Ser 


Gly 


Leu 


Tyr 


Ser 


Leu 


Ser 


Ser 


Val 


Val 


Thr 


Val 


Pro 


Ser 








180 










185 










190 






Ser 


Ser 


Leu 


Gly 


Thr 


Gin 


Thr 


Tyr 


He 


Cys 


Asn 


Val 


Asn 


His 


Lys 


Pro 






195 










200 










205 








Ser 


Asn 


Thr 


Lys 


Val 


Asp 


Lys 


Lys 


Val 


Glu 


Pro 


Lys 


Ser 


Cys 


Asp 


Lys 




210 










215 










220 










Thr 


His 


Thr 


Cys 


Pro 


Pro 


Cys 


Pro 


Gly 


Gin 


Pro 


Arg 


Glu 


Pro 


Gin 


Val 


225 










230 










235 










240 


Tyr 


Thr 


Leu 


Pro 


Pro 


Ser 


Arg 


Asp 


Glu 


Leu 


Thr 


Lys 


Asn 


Gin 


Val 


Ser 










245 










250 










255 




Leu 


Thr 


Cys 


Leu 


Val 


Lys 


Gly 


Phe 


Tyr 


Pro 


Ser 


Asp 


He 


Ala 


Val 


Glu 








260 










265 










270 






Trp 


Glu 


Ser 


Asn 


Gly 


Gin 


Pro 


Glu 


Asn 


Asn 


Tyr 


Lys 


Thr 


Thr 


Pro 


Pro 






275 










280 










285 
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Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val 

290 295 300 

Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met 
305 310 315 320 

His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser 

325 330 335 

Pro Gly Lys 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CTGCCTAGCC CTCTAGACGA TCCACTGGAC TCCGCGCGGC CGAAGCTTAT CGGTCTCATT 60 

GGAAAAAAAA ATTAAAATAA AATAAAATAA AAACTCTACC TCAAACCGCG GCTAGAGGGC 120 

TAGGGGATAC CAGCTGAGAG TCATGTTAGA CGAGACTACG GCGTATCAAT TCGGTCATAG 180 

ACGAGGGACG AACACACAAC CTCCAGCGAC TCATCACGCG CTCGTTTTAA ATTCGATGTT 240 

GTTCCGTTCC GAACTGGCTG TTAACGTACT TCTTAGACGA ATCCCAATCC GCAAAACGCG 300 

ACGAAGCGCT ACATGCCCGG TCTATATGCG CAACTGTAAC TAATAACTGA TCAATAATTA 360 

TCATTAGTTA ATGCCCCAGT AATCAAGTAT CGGGTATATA CCTCAAGGCG CAATGTATTG 420 
----- - - AATGCCATTT ACCGGGCGGA . CCGACTGGCG : GGTTGCTGGG . .GGCGGGTAAC TGCAGTTATT . , 480< , v, / - 

ACTGCATACA AGGGTATCAT TGCGGTTATC CCTGAAAGGT AACTGCAGTT ACCCAGCTGA - 540 = 

" TAAATGCCAT TTGAC.GGGTG AACCGTCATG TAGTTCACAT AGTATACGGT TCATGCGGGG 600 
-irGATAACTGCA- GTTACTGCCA TTTACCGGGC GGACCGTAAT* ACGGGTCATG TAGTGGAATA .: _ .::6-60- ' * rr: 

~^CCCTGAAAGG"ATGAACCGTC ATGTAGATGC ATAATCAGTA* GCGATAATGG TACCACTACG" Z.. "720 ~ 

- CCAAAACCGT CATGTAGTTA CCCGCACCTA TCGCCAAACT GAGTGCCCCT AAAGGTTCAG 780 . \ „. 
" AGGTGGGGTA ACTGCAGTTA CCCTCAAACA AAACCGTGGT TTTAGTTGCC CTGAAAGGTT. -840 ' 'i-r-'y. 

TTACAGCATT GTTGAGGCGG GGTAACTGCG TTTACCCGCC ATCCGCACAT GCCACCCTCC 900 

AGATATATTC GTCTCGAGAG ACCGATTGAT CTCTTGGGTG ACGAATGACC GAATAGCTTT ,960 

AATTATGCTG AGTGATATCC CTCTGGGTTC GAACCATGGT TAAATTTAAC TATAGAGGAA 1020 

TCCAGAGCTC AGAGATCTAT TGGCCAGTTA GCTAACCTTA AGAACGCCGG CGAACGATCG 1080 

GTGGTACCTC AACACCAATT CGAACCAGGA AGGAACAGGA ACAAAATTTT CCACAGGTCA 1140 

CACTTCACTT AGACCACCTC AGACCCCCTC CGAATCACGT CGGACCTCCC AGGGACTTTC 1200 

AGAGGACACA TTGGAGACCT AAGTGAAAGT CACTGATAAT GTACATAACC CAAGCGGTCT 1260 

GAGGTCTCTT CTCCGACCTC ACCCAGCGTA TGTAATCAGT TCCACCACTA TATTGGCTGA 1320 

TAGGTCTGTG ACATTTCCCA GCTAAGTGGT AGAGGTCTCT GTTACGGTTC TTGTGGGACA 1380 

TGGACGTTTA CTCGGCAGAC TTCAGACTCC TGTGTCGGTA CATAATGACA CGTTCTCCGG 1440 

ACCTGCTGCC CCGGACCAAA CGAATGACCC CGGTTCCCTG AGACCAGTGC CAGAGACATC 1500 

GATCGTGGTT CCCGGGTAGC CAGAAGGGGG ACCGTGGGAG GAGGTTCTCG TGGAGACCCC 1560 

CGTGTCGCCG GGACCCGACG GACCAGTTCC TGATGAAGGG GCTTGGCCAC TGCCACAGCA 1620 
CCTTGAGTCC GCGGGACTGG TCGCCGCACG TGTGGAAGGG CCGACAGGAT GTCAGGAGTC ' 1680 

CTGAGATGAG GGAGTCGTCG CACCAGTGGC ACGGGAGGTC GTCGAACCCG TGGGTCTGGA 1740 

TGTAGACGTT GCACTTAGTG TTCGGGTCGT TGTGGTTCCA CCTGTTCTTT CAACCACTCT 1800 

CCGGTCGTGT CCCTCCCTCC CACAGACGAC CTTCGGTCCG AGTCGCGAGG ACGGACCTGC 1860 

GTAGGGCCGA TACGTCGGGG TCAGGTCCCG TCGTTCCGTC CGGGGCAGAC GGAGAAGTGG 1920 

GCCTCCGGAG ACGGGCGGGG TGAGTACGAG TCCCTCTCCC AGAAGACCGA AAAAGGGGTC 1980 

CGAGACCCGT CCGTGTCCGA TCCACGGGGA TTGGGTCCGG GACGTGTGTT TCCCCGTCCA 2040 

CGACCCGAGT CTGGACGGTT CTCGGTATAG GCCCTCCTGG GACGGGGACT GGATTCGGGT 2100 
GGGGTTTCCG GTTTGAGAGG TGAGGGAGTC GAGCCTGTGG AAGAGAGGAG GGTCTAAGGT . 2160 

CATTGAGGGT TAGAAGAGAG ACGTCTCGGG TTTAGAACAC TGTTTTGAGT GTGTACGGGT 2220 
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GGCACGGGTC CATTCGGTCG GGTCCGGAGC GGGAGGTCGA GTTCCGCCCT GTCCACGGGA 2280 

TCTCATCGGA' CGTAGGTCCC TGTGTGGTGC ACCCATGGTT GTACAGGCCT CGGTGTACCT 2340 

GTCTCCGGCC GAGCCGGGTG GGAGACGGGA CTCTCACTGG CGACATGGTT GGAGACAGGG 2400 

ATGTCCCGTC GGGGCTCTTG GTGTCCACAT GTGGGACGGG GGTAGGGCCC TACTCGACTG 2460 

GTTCTTGGTC CAGTCGGACT GGACGGACCA GTTTCCGAAG ATAGGGTCGC TGTAGCGGCA 2520 

CCTCACCCTC TCGTTACCCG TCGGCCTCTT GTTGATGTTC TGGTGCGGAG GGCACGACCT 2580 

GAGGCTGCCG AGGAAGAAGG AGATGTCGTT CGAGTGGCAC CTGTTCTCGT CCACCGTCGT 2640 

■ CCCCTTGCAG AAGAGTACGA GGCACTACGT ACTCCGAGAC GTGTTGGTGA TGTGCGTCTT 27 00 

CTCGGAGAGG GACAGAGGCC CATTTACTCA CGCTGCCGGC CGTTCGGGGG CGAGGGGCCC 2760 

GAGAGCGCCA GCGTGCTCCT ACGAACCGTG CATGGGGGAC ATGTATGAAG GGCCCGCGGG 2820 

TCGTACCTTT ATTTCGTGGG TCGCGACGGG ACCCGGGGAC GCTCTGACAC TACCAAGAAA 2880 

GGTGCCCAGT CCGGCTCAGA CTCCGGACTC ACCGTACTCC CTCCGTCTCG CCCAGGGTGA 2940 

CAGGGGTGTG ACCGGGTCCG ACACGTCCAC ACGGACCCGG GGGATCCCAC CCCGAGTCGG 3000 

TCCCCGACGG GAGCCGTCCC ACCCCCTAAA CGGTCGCACC GGGAGGGAGG TCGTCGTGGA 3060 

CGGGACCCGA CCCGGTGCCC TTCGGGATCC TCGGGGACCC CTGTCTGTGT GTCGGGGACG 3120 

GAGACATCCT CTGACAGGAC AAGACACTCG CGGGGACAGG AGGGCTGGAG GTACGGGTGA 3180 

GCCCCCGTAC GGATCAGGTA CACGCATCCC TGTCCGGGAG GGAGTGGGTA GATGGGGGTG 3240 

CCGTGATTGG GGACCGACGG GACGGGTCGG AGCGTGGGCG TACCCCTGTG TTGGCTGAGG 3300 

CCCCTGTACG TGAGAGCCCG GGACACCTCC CTGACCACGT CTACGGGTGT GTGTGTGAGT 3360 

CGGGTCTGGG CAAGTTGTTT GGGGCGTGAC TCCAACCGGC CGGTGTGCCG GTGGTGTGTG 3420 

TGTGCACGTG CGGAGTGTGT GCCTCGGAGT GGGCCCGCTT GACGTGTCGT GGGTCTGGTC 3480 

TCGTTCCAGG AGCGTGTGCA CTTGTGAGGA GCCTGTGTCC GGGGGTGCTC GGGGTGCGCC 3540 

GTGGAGTTCC GGGTGCTCGG AGAGCCGTCG AAGAGGTGTA CGACTGGACG AGTCTGTTTG 3600 

GGTCGGGAGG AGAGTGTTCC CACGGGGACG TCGGCGGTGT GTGTGTGTCC CCTAGTGTGT 3660 

GGTGCAGTGC AGGGACCGGG ACCGGGTGAA GGGTCACGGC GGGAAGGGAC GTCCTGCCTA 372 0 

GTCGGAGCTG ACACGGAAGA TCAACGGTCG GTAGACAACA AACGGGGAGG GGGCACGGAA 3780 

. GGAACTGGGA CCTTCCACGG TGAGGGTGAC AGGAAAGGAT TATTTTACTC CTTTAACGTA . 3.840 

• • ■" - - ' : - GCGTAAC AG A CTCATCCACA ' GTAAGATAAG ACCCCCCAGC: CCACCCCGTC CTGTCGT-TCC 3900 

• ■ • CCCTCCTAAC CCTTCTGTTA TCGTGCGTAC- GACeCGTAGG' GCACCCGAGA TACCGAAGAC 3960 
■- TCCGCCTTTC TTGGTCGACC CCGAGATCCC CCATAGGGGT GCGCGGGACA TCGCCGCGTA . 4020 
ATTCGCGCCG CCCACACCAC CAATGGGCGI^ CGCACTGGCGTATGTGAACGG: -TCGGGGGATC r "\;4080 

" '"~GCGGGCGAGG T KKAGCGAfiAG" AAGGGAAGGA" AAGAGCGGTG_r AAGCGGCCCGGAGAGTTTT' "4140' 

- :TT.CCCTTTTT . TTCGTACGTA GAGTTAATCA GTCGTTGGTA ' TCAGGGCGGG GATTGAG.GCG . 42 00. 

"GGTAGGGCGG GGATTGAGGC GGGTC AAGGC GGGTAAGAGG "CGGGGTACCG 'AC TGATT AAA 4260" 

AAAAATAAAT ACGTCTCCGG' CTCCGGCGGA GCCGGAGACT CGATAAGGTC TTCATCACTC .4320 

CTCCGAAAAA ACCTCCGGAT CCGAAAACGT TTTTCGAACC TGTCGAGTCC CGACGCTAAA 4380 

GCGCGGTTTG AACTGCCGTT AGGATCGCAC TTCCGACCAT GCTAAAATAG GGGCGACGGT 4440 

AGTACCAAGC TGGTAACTTG ACGTAGCAGC GGCACAGGGT TTTATACCCC TAACCGTTCT 4500 

TGCCTCTGGA TGGGACCGGA GGCGAGTCCT TGCTCAAGTT CATGAAGGTT TCTTACTGGT 4560 

GTTGGAGAAG TCACCTTCCA TTTGTCTTAG ACCACTAATA CCCATCCTTT TGGACCAAGA 4620 

GGTAAGGACT CTTCTTAGCT GGAAATTTCC TGTCTTAATT ATATCAAGAG TCATCTCTTG 4680 

AGTTTCTTGG TGGTGCTCCT CGAGTAAAAG AACGGTTTTC AAACCTACTA CGGAATTCTG 4740 

AATAACTTGT TGGCCTTAAC CGTTCATTTC ATCTGTACCA AACCTATCAG CCTCCGTCAA 4800 

GACAAATGGT CCTTCGGTAC TTAGTTGGTC CGGTGGAATC TGAGAAACAC TGTTCCTAGT 4860 

ACGTCCTTAA ACTTTCACTG TGCAAAAAGG GTCTTTAACT AAACCCCTTT ATATTTGAAG 4920 

AGGGTCTTAT GGGTGCGCAG. GAGAGACTCC AGGTCCTCCT TTTTCCGTAG TTCATATTCA 4980 

AACTTCAGAT GCTCTTCTTT CTGATTGTCC TTCTACGAAA GTTCAAGAGA CGAGGGGAGG 5040 

ATTTCGATAC GTAAAAATAT TCTGGTACCC TGAAAACGAC CGAAATCTAG AGAAACACTT 5100 

CCTTGGAATG AAGACACCAC ACTGTATTAA CCTGTTTGAT GGATGTCTCT AAATTTCGAG 5160 

ATTCCATTTA TATTTTAAAA ATTCACATAT TACACAATTT GATGACTAAG ATTAACAAAC 5220 

ACATAAAATC TAAGGTTGGA TACCTTGACT ACTTACCCTC GTCACCACCT TACGGAAATT 5280 

ACTCCTTTTG GACAAAACGA GTCTTCTTTA CGGTAGATCA . CTACTACTCC GATGACGACT 5340 

GAGAGTTGTA AGATGAGGAG GTTTTTTCTT CTCTTTCCAT CTTCTGGGGT TCCTGAAAGG 5400 

AAGTCTTAAC GATTCAAAAA ACTCAGTACG ACACAAATCA TTATCTTGAG AACGAACGAA 5460 

ACGATAAATG TGGTGTTTCC TTTTTCGACG TGACGATATG TTCTTTTAAT ACCTTTTTAT 5520 

AAGACATTGG AAATATTCAT CCGTATTGTC AATATTAGTA TTGTATGACA AAAAAGAATG 5580 

AGGTGTGTCC GTATCTCACA GACGATAATT ATTGATACGA GTTTTTAACA CATGGAAATC 5640 
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GAAAAATTAA ACATTTCCCC AATTATTCCT TATAAACTAC ATATCACGGA ACTGATCTCT 5700 

AGTATTAGTC GGTATGGTGT AAACATCTCC AAAATGAACG AAATTTTTTG GAGGGTGTGG 57 60 

AGGGGGACTT GGACTTTGTA TTTTACTTAC GTTAACAACA ACAATTGAAC AAATAACGTC 582 0 

GAATATTACC AATGTTTATT TCGTTATCGT AGTGTTTAAA GTGTTTATTT CGTAAAAAAA 5880 

GTGACGTAAG ATCAACACCA AACAGGTTTG AGTAGTTACA TAGAATAGTA CAGACCTAGC 5940 

CGACCTACTA GGAGGTCGCG CCCCTAGAGT ACGACCTCAA GAAGCGGGTG GGGTTGAACA 6000 

AATAACGTCG AATATTACCA ATGTTTATTT CGTTATCGTA GTGTTTAAAG TGTTTATTTC 6060 

GTAAAAAAAG TGACGTAAGA TCAACACCAA ACAGGTTTGA GTAGTTACAT AGAATAGTAC 6120 

AGACATATGG CAGCTGGAGA TCGATCTCGA ACCGCATTAG TACCAGTATC GACAAAGGAC 6180 

ACACTTTAAC AATAGGCGAG TGTTAAGGTG TGTTGTATGC TCGGCCTTCG TATTTCACAT 6240 

TTCGGACCCC ACGGATTACT CACTCGATTG AGTGTAATTA ACGCAACGCG AGTGACGGGC 63 00 

GAAAGGTCAG CCCTTTGGAC AGCACGGTCG ACGTAATTAC TTAGCCGGTT GCGCGCCCCT 63 60 

CTCCGCCAAA CGCATAACCC GCGAGAAGGC GAAGGAGCGA GTGACTGAGC GACGCGAGCC 642 0 

AGCAAGCCGA CGCCGCTCGC CATAGTCGAG TGAGTTTCCG CCATTATGCC AATAGGTGTC 6480 

TTAGTCCCCT ATTGCGTCCT TTCTTGTACA CTCGTTTTCC GGTCGTTTTC CGGTCCTTGG 6540 

CATTTTTCCG GCGCAACGAC CGCAAAAAGG TATCCGAGGC GGGGGGACTG CTCGTAGTGT 6600 

TTTTAGCTGC GAGTTCAGTC TCCACCGCTT TGGGCTGTCC TGATATTTCT ATGGTCCGCA 6660 

AAGGGGGACC TTCGAGGGAG CACGCGAGAG GACAAGGCTG GGACGGCGAA TGGCCTATGG 6720 

ACAGGCGGAA AGAGGGAAGC CCTTCGCACC GCGAAAGAGT TACGAGTGCG ACATCCATAG 67 80 

AGTCAAGCCA CATCCAGCAA GCGAGGTTCG ACCCGACACA CGTGCTTGGG GGGCAAGTCG 6840 

GGCTGGCGAC GCGGAATAGG CCATTGATAG CAGAACTCAG GTTGGGCCAT TCTGTGCTGA 6900 

ATAGCGGTGA CCGTCGTCGG TGACCATTGT CCTAATCGTC TCGCTCCATA CATCCGCCAC 6960 

GATGTCTCAA GAACTTCACC ACCGGATTGA TGCCGATGTG ATCTTCCTGT CATAAACCAT 7 020 

AGACGCGAGA CGACTTCGGT CAATGGAAGC CTTTTTCTCA ACCATCGAGA ACTAGGCCGT 7 080 

TTGTTTGGTG GCGACCATCG CCACCAAAAA AACAAACGTT CGTCGTCTAA TGCGCGTCTT 7140 

TTTTTCCTAG AGTTCTTCTA GGAAACTAGA AAAGATGCCC CAGACTGCGA GTCACCTTGC 7200 

TTTTGAGTGC AATTCCCTAA AACCAGTACT CTAATAGTTT. TTCCTAGAAG TGGATCTAGG 7260 

AAAATTTAAT -TTTTACTTCA AAATTTAGTT -AGATTTCATA - TATACTCATT TGAACCAGAC 7320 
TGTCAATGGT- ~T AC G AATT AG ' TGACTCCGTG -GATAGAGTGG- GTAGACAGAT AAAGCAAGTA - 73 8 0 - 

GGTATCAACG "GACTGAGGGG CAGCACATCT ATTGATGCTA TGCCCTCCCG AATGGTAGAC 7440 

CGGGGTGA^G' A€^TTACTAT=G5CGGTCTCG^TGGGAGT.GG, CCGAGGTCTA AATAGTCGTT' 7 500 

;ATTTGGTCGa~TCGGCCTTCC^ 7 560 

AGGTCAGATA ATTAACAACG GCCCTTCGAT GTCATTCATC. . AAGCGGTCAA. TTATCAAACG 7 620 
CGTTGCAACA ACGGTAACGA TGTCGGTAGG "ACCACAGTGC GAGCAGCAAA CCATACCGAA * 7 680 

GTAAGTCGAG GCCAAGGGTT GCTAGTTCCG CTCAATGTAC ' TAGGGGGTAC AACACGTTTT 7740 

TTCGCCAATC GAGGAAGCCA GGAGGCTAGC AACAGTCTTC ATTCAACCGG CGTCACAATA 7800 

GTGAGTACCA ATACCGTCGT GACGTATTAA GAGAATGACA GTACGGTAGG CATTCTACGA 7860 

AAAGACACTG ACCACTCATG AGTTGGTTCA GTAAGACTCT TATCACATAC GCCGCTGGCT 792 0 

CAACGAGAAC GGGCCGCAGT TATGCCCTAT TATGGCGCGG TGTATCGTCT TGAAATTTTC 7980 

AC GAG TAG TA ACCTTTTGCA AGAAGCCCCG CTTTTGAGAG TTCCTAGAAT GGCGACAACT 8040 

CTAGGTCAAG CTACATTGGG TGAGCACGTG GGTTGACTAG AAGTCGTAGA AAATGAAAGT 8100 

GGTCGCAAAG ACCCACTCGT TTTTGTCCTT CCGTTTTACG GCGTTTTTTC CCTTATTCCC 8160 

GCTGTGCCTT TACAACTTAT GAGTATGAGA AGGAAAAAGT TATAATAACT TCGTAAATAG 822 0 

TCCCAATAAC AGAGTACTCG CCTATGTATA AACTTACATA AATCTTTTTA TTTGTTTATC 8280 

CCCAAGGCGC GTGTAAAGGG GCTTTTCACG GTGGACTGCA G 8321 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
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CCATGGTTAA ATTTAACTAT AGAGGAATCC AGAGCTCAGA GATCTATTGG CCAGTTAGCT 60 

AACCTTAAGA ACGCCGGCGA ACGATCGTGG TTCCCGGGTA GCCAGAAGGG GGACCGTGGG 120 

AGGAGGTTCT CGTGGAGACC CCCGTGTCGC CGGGACCCGA CGGACCAGTT CCTGATGAAG 180 

GGGCTTGGCC ACTGCCACAG CACCTTGAGT CCGCGGGACT GGTCGCCGCA CGTGTGGAAG 240 

GGCCGACAGG ATGTCAGGAG TCCTGAGATG AGGGAGTCGT CGCACCAGTG GCACGGGAGG 300 

TCGTCGAACC CGTGGGTCTG GATGTAGACG TTGCACTTAG TGTTCGGGTC GTTGTGGTTC 360 

CACCTGTTCT TTCAACCACT CTCCGGTCGT GTCCCTCCCT CCCACAGACG ACCTTCGGTC 420 

CGAGTCGCGA GGACGGACCT GCGTAGGGCC GATACGTCGG GGTCAGGTCC CGTCGTTCCG 480 

TCCGGGGCAG ACGGAGAAGT GGGCCTCCGG AGACGGGCGG GGTGAGTACG AGTCCCTCTC 540 

CCAGAAGACC GAAAAAGGGG TCCGAGACCC GTCCGTGTCC GATCCACGGG GATTGGGTCC 600 

GGGACGTGTG TTTCCCCGTC CACGACCCGA GTCTGGACGG TTCTCGGTAT AGGCCCTCCT 660 

GGGACGGGGA CTGGATTCGG GTGGGGTTTC CGGTTTGAGA GGTGAGGGAG TCGAGCCTGT 720 

GGAAGAGAGG AGGGTCTAAG GTCATTGAGG GTTAGAAGAG AGACGTCTCG GGTTTAGAAC 780 

ACTGTTTTGA GTGTGTACGG GTGGCACGGG TCCATTCGGT CGGGTCCGGA GCGGGAGGTC 840 

GAGTTCCGCC CTGTCCACGG GATCTCATCG GACGTAGGTC CCTGTCCGGG GTCGGCCCAC 900 

GACTGTGGAG GTGGAGGTAG AGAAGGAGTC GTGGACTTGA GGACCCCCCT GGCAGTCAGA 960 

AGGAGAAGGG GGGTTTTGGG TTCCTGTGGG AGTACTAGAG GGCCTGGGGA CTCCAGTGTA 1020 

CGCACCACCA CCTGCACTCG GTGCTTCTGG GACTCCAGTT CAAGTTGACC ATGCACCTGC 1080 

CGCACCTCCA CGTATTACGG TTCTGTTTCG GCGCCCTCCT CGTCATGTTG TCGTGCATGG 1140 

CACACCAGTC GCAGGAGTGG CAGGACGTGG TCCTGACCGA CTTACCGTTC CTCATGTTCA 1200 

CGTTCCAGAG GTTGTTTCGG GAGGGTCGGG GGTAGCTCTT TTGGTAGAGG TTTCGGTTTC 12 60 

CACCCTGGGC ACCCCACGCT CCCGGTGTAC CTGTCTCCGG CCGAGCCGGG TGGGAGACGG 1320 

GACTCTCACT GGCGACATGG TTGGAGACAG GGATGTCCCG TCGGGGCTCT TGGTGTCCAC 13 80 

ATGTGGGACG GGGGTAGGGC CCTACTCGAC TGGTTCTTGG TCCAGTCGGA CTGGACGGAC 1440 

CAGTTTCCGA AGATAGGGTC GCTGTAGCGG CACCTCACCC TCTCGTTACC CGTCGGCCTC 1500 

. TTGTTGAT.GT TCTGG.TGCGG. AGGGCACGAC CTGAGGCTGC CGAGGAAGAA GGAGATGTCG 1560 

. TTCGAGTGGG-^ACGTGTTCTC-GTCCACCGTC^ GTCCCCTTGC . AGAAGAGTAC GAGGCACTAC 1620 
- GTACTCCGAG -ACGTGTTGGT -GATGTGGGTG TTCTGGGAGA GGGACAGAGG GGCATTTACT- — 1680 

CACGCTGGCG'GCCGTTCGGG.'GGCGAGGGGC CCGAGAGCGC" -CAGCGTGCTC CTACGAACCG 1740 

' TG ^A4WGG(g-^A TG0aTGA^gGGGCCGGGG GGTCGTACCT TTATTTCGTG GGTCGCGAGG* :rl800 
GG ACCCGGGU-. AC ' GL ' TL ' T GAC~ ACT AC CAAGA " AAGGTGCCCA~~ GTCCGGCTCA "GACTCCGGAC" — I860 

TCACCGTACT -CCCTCCGTCT/J CGCCCAGGGT GACAGGGGTG „TGACCGGGTC CGACACGT.CC 1920 

ACACGGACCC: GGGGGATCCC ACGCCGAGTC- GGTCCGCGAC GGGAGCCGTC CCACCCCCTA 1980 

AACGGTCGCA CCGGGAGGGA GGTCGTCGTG GACGGGACCC GACCCGGTGC CCTTCGGGAT 2 040 

CCTCGGGGAC CCCTGTCTGT GTGTCGGGGA CGGAGACATC CTCTGACAGG ACAAGACACT 2100 

CGCGGGGACA GGAGGGCTGG AGGTACGGGT GAGCCCCCGT ACGACCCCTA CGCCACCCGA 2160 

GATACCGAAG ACTCCGCCTT TCTTGGTCGA CCCCGAGATC CCCCATAGGG GTGCGCGGGA 2220 

CATCGCCGCG TAATTCGCGC CGCCCACACC ACCAATGCGC GTCGCACTGG CGATGTGAAC 22 80 

GGTCGCGGGA TCGCGGGCGA GGAAAGCGAA AGAAGGGAAG GAAAGAGCGG TGCAAGCGGC 2340 

CGAAAGGGGC AGTTCGAGAT TTAGCCCCGT AGGGAAATCC CAAGGCTAAA TCACGAAATG 2400 

CCGTGGAGCT GGGGTTTTTT GAACTAATCC CACTACCAAG TGCATCACCC GGTAGCGGGA 2460 

CTATCTGCCA AAAAGCGGGA AACTGCAACC TCAGGTGCAA GAAATTATCA CCTGAGAACA 2520 

AGGTTTGACC TTGTTGTGAG TTGGGATAGA GCCAGATAAG AAAACTAAAT ATTCCCTAAA 2580 

ACCCCTAAAG CCGGATAACC AATTTTTTAC TCGACTAAAT TGTTTTTAAA TTGCGCTTAA 2640 

TTAAGACACC TTACACACAG TCAATCCCAC ACCTTTCAGG ' GGTCCGAGGG GTCCGTCCGT 27 00 

CTTCATACGT TTCGTACGTA GAGTTAATCA GTCGTTGGTA TCAGGGCGGG GATTGAGGCG 27 60 

GGTAGGGCGG GGATTGAGGC GGGTCAAGGC GGGTAAGAGG CGGGGTACCG AC TG ATT AAA 2820 

AAAAATAAAT ACGTCTCCGG CTCCGGCGGA GCCGGAGACT CGATAAGGTC TTCATCACTC 2880 

CTCCGAAAAA ACCTCCGGAT CCGAAAACGT TTTTCGAACC TGTCGAGTCC CGACGCTAAA 2940 

GCGCGGTTTG AACTGCCGTT AGGATCGCAC TTCCGACCAT CCTAAAATAG GGGCGACGGT 3000 

AGTACCAAGC TGGTAACTTG ACGTAGCAGC GGCACAGGGT TTTATACCCC TAACCGTTCT 3060 

TGCCTCTGGA TGGGACCGGA GGCGAGTCCT TGCTCAAGTT CATGAAGGTT TCTTACTGGT 3120 

GTTGGAGAAG TCACCTTCCA TTTGTCTTAG ACCACTAATA CCCATCCTTT TGGACCAAGA 3180 

GGTAAGGACT CTTCTTAGCT GGAAATTTCC TGTCTTAATT ATATCAAGAG TCATCTCTTG 3240 

AGTTTCTTGG TGGTGCTCCT CGAGTAAAAG AACGGTTTTC AAACCTACTA CGGAATTCTG 33 00 

AATAACTTGT TGGCCTTAAC CGTTCATTTC ATCTGTACCA AACCTATCAG CCTCCGTCAA 33 60 
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GACAAATGGT CCTTCGGTAC TTAGTTGGTC CGGTGGAATC TGAGAAACAC TGTTGCTAGT 3420 

ACGTCCTTAA ACTTTCACTG TGCAAAAAGG GTCTTTAACT AAACCCCTTT ATATTTGAAG 3480 

AGGGTCTTAT GGGTCCGCAG GAGAGACTCC AGGTCCTCCT TTTTCCGTAG TTCATATTCA 3540 

AACTTCAGAT GCTCTTCTTT CTGATTGTCC TTCTACGAAA GTTCAAGAGA CGAGGGGAGG 3600 

ATTTCGATAC GTAAAAATAT TCTGGTACCC TGAAAACGAC CGAAATCTAG AGAAACACTT 3660 

CCTTGGAATG AAGACACCAC ACTGTATTAA CCTGTTTGAT GGATGTCTCT AAATTTCGAG 3720 

ATTCCATTTA TATTTTAAAA ATTCACATAT TACACAATTT GATGACTAAG ATTAACAAAC 3780 

ACATAAAATC TAAGGTTGGA TACCTTGACT ACTTACCCTC GTCACCACCT TACGGAAATT 3840 

ACTCCTTTTG GACAAAACGA GTCTTCTTTA CGGTAGATCA CTACTACTCC GATGACGACT 3900 

GAGAGTTGTA AGATGAGGAG GTTTTTTCTT CTCTTTCCAT CTTCTGGGGT TCCTGAAAGG 3960 

AAGTCTTAAC GATTCAAAAA ACTCAGTACG ACACAAATCA TTATCTTGAG AACGAACGAA 402 0 

ACGATAAATG TGGTGTTTCC TTTTTCGACG TGACGATATG TTCTTTTAAT ACCTTTTTAT 4080 

AAGACATTGG AAATATTCAT CCGTATTGTC AATATTAGTA TTGTATGACA AAAAAGAATG 4140 

AGGTGTGTCC GTATCTCACA GACGATAATT ATTGATACGA GTTTTTAACA CATGGAAATC 4200 

GAAAAATTAA ACATTTCCCC AATTATTCCT T AT AAAC T AC ATATCACGGA ACTGATCTCT 4260 

AGTATTAGTC GGTATGGTGT AAACATCTCC AAAATGAACG AAATTTTTTG GAGGGTGTGG 4320 

AGGGGGACTT GGACTTTGTA TTTTACTTAC GTTAACAACA ACAATTGAAC AAATAACGTC 4380 

GAATATTACC AATGTTTATT TCGTTATCGT AGTGTTTAAA GTGTTTATTT CGTAAAAAAA 4440 

GTGACGTAAG ATCAACACCA AACAGGTTTG AGTAGTTACA TAGAATAGTA CAGACCTAGC 4500 

CGACCTACTA GGAGGTCGCG CCCCTAGAGT ACGACCTCAA GAAGCGGGTG GGGTTGAACA 4560 

AATAACGTCG AATATTACCA ATGTTTATTT CGTTATCGTA GTGTTTAAAG TGTTTATTTC 4620 

GTAAAAAAAG TGACGTAAGA TCAACACCAA ACAGGTTTGA GTAGTTACAT AGAATAGTAC 4680 

AGACATATGG CAGCTGGAGA TCGATCTCGA ACCGCATTAG TACCAGTATC GACAAAGGAC 4740 

ACACTTTAAC AATAGGCGAG TGTTAAGGTG TGTTGTATGC TCGGCCTTCG TATTTCACAT 4800 

TTCGGACCCC ACGGATTACT CACTCGATTG AGTGTAATTA ACGCAACGCG AGTGACGGGC 4860 

GAAAGGTCAG CCCTTTGGAC AGCACGGTCG ACGTAATTAC TTAGCCGGTT GCGCGCCCCT 4920 
- . . CTCCGCCAAA -GGCATAACCC. GCGAGAAGGC GAAGGAGCGA GTGACTGAGC GACGCGAGCC _ . 4980 . _..„..„■ 
* *. " AGCAAGCCGA GGCCGCTCGC: CATAGTCGAG TGAGTTTGCG CCATTATGCC AATAGGTGT.C:- 5 0 4 0 

• -TTAGTGGGCT- ATTGCGTGCT- -TTCTTGTAGA 1 CTGGTTT-TCC- GGTGGTTTTC • CGGTCGTTGG' -5100 - : 

" CATTT-TTCCG . "GCGC AACGAC CGCAAAAAGG TATCCGAGGC --GGGGGGACTG CTCGTAGTGT 5160. " 

: - ,a T'FTTAGCTGG^^AGTTCAGTC TCCACCGCTT TGGGCTGTCC --^TGATAI^TTCT" ATGGTCCGCKrr-z^5-22:0^ ™ 

~— : : — * A7COTGGG AC C" TTCG AGGGAG"~CAC GC G AGXG : "GACAAGGCTG'*~GGACGGCGAA TGGCCT2£T9Gzr7"5-2-8*0 s T 

ACA'GGCGGAA AGAGGGAAGC CCTTCGCACC GCGAAAGAGT TACGAGTGCG ACATCCATAG ; 5340 . 

AGTGAAGCCA CATCCAGCAA GCGAGGTTCG ACCCGACACA CGTGCTTGGG GGGCAAGTCG " ''5400' "v.v- 

GTTGGGCCAT TCTGTGCTGA 5460 

TCGCTCCATA CATCCGCCAC 552 0 

ATCTTCCTGT CATAAACCAT 5580 

ACCATCGAGA ACTAGGCCGT 5640 

CGTCGTCTAA TGCGCGTCTT 5700 

CAGACTGCGA GTCACCTTGC 5760 

TTCCTAGAAG TGGATCTAGG 582 0 

TATACTCATT TGAACCAGAC 5880 

CTAGACAGAT AAAGCAAGTA 5940 

TGCCCTCCCG AATGGTAGAC 6000 

CCGAGGTCTA AATAGTCGTT 6060 

ACGTTGAAAT AGGCGGAGGT 6120 

AAGCGGTCAA TTATCAAACG 6180 

GAGCAGCAAA CCATACCGAA 6240 

TAGGGGGTAC AACACGTTTT 6300 

ATTCAACCGG CGTCACAATA 6360 

GTACGGTAGG CATTCTACGA 6420 

TATCACATAC GCCGCTGGCT 6480 

TGTATCGTCT TGAAATTTTC 6540 

TTCCTAGAAT GGCGACAACT 6600 

AAGTCGTAGA AAATGAAAGT 6660 

GCGTTTTTTC CCTTATTCCC 6720 

TATAATAACT TCGTAAATAG 6780 



GGCTGGCGAC "GCGGAATAGG 
ATAGCGGTGA CCGTCGTCGG 
GATGTCTCAA GAACTTCACC 
AGACGCGAGA CGACTTCGGT 
TTGTTTGGTG GCGACCATCG 
TTTTTCCTAG AGTTCTTCTA 
TTTTGAGTGC AATTCCCTAA 
AAAATTTAAT TTTTACTTCA 
TGTCAATGGT TACGAATTAG 
GGTATCAACG GACTGAGGGG 
CGGGGTCACG ACGTTACTAT 
ATTTGGTCGG TCGGCCTTCC 
AGGTCAGATA ATTAACAACG 
CGTTGCAACA ACGGTAACGA 
GTAAGTCGAG GCCAAGGGTT 
TTCGCCAATC GAGGAAGCCA 
GTGAGTACCA ATACCG.TCGT 
AAAGACACTG ACCACTCATG 
CAACGAGAAC GGGCCGCAGT 
ACGAGTAGTA ACCTTTTGCA 
CTAGGTCAAG CTACATTGGG 
GGTCGCAAAG ACCCACTCGT 
GCTGTGCCTT TACAACTTAT 



CCATTGATAG CAGAACTCAG 
TGACCATTGT CCTAATCGTC 
ACCGGATTGA TGCCGATGTG 
CAATGGAAGC CTTTTTCTCA 
CCACCAAAAA AACAAACGTT 
GGAAACTAGA AAAGATGCCC 
AACCAGTACT CTAATAGTTT 
AAATTTAGTT AGATTTCATA 
TCACTCCGTG GATAGAGTCG 
CAGCACATCT ATTGATGCTA 
GGCGCTCTGG GTGCGAGTGG 
CGGCTCGCGT CTTCACCAGG 
GCCCTTCGAT CTCATTCATC 
TGTCCGTAGC ACCACAGTGC 
GCTAGTTCCG CTCAATGTAC 
GGAGGCTAGC AACAGTCTTC 
GACGTATTAA GAGAATGACA 
AGTTGGTTCA GTAAGACTCT 
TATGCCCTAT TATGGCGCGG 
AGAAGCCCCG CTTTT.GAGAG 
TGAGCACGTG GGTTGACTAG 
TTTTGTCCTT CCGTTTTACG 
GAGTATGAGA AGGAAAAAGT 
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TCCCAATAAC AGAGTACTCG CCTATGTATA AACTTACATA AATCTTTTTA TTTGTTTATC 
CCCAAGGCGC GTGTAAAGGG GCTTTTCACG GTGGACTGCA GCTGCCTAGC CCTCTAGACG 
ATCCACTGGA CTCCGCGCGG CCGAAGCTTA TCGGTCTCAT TGGAAAAAAA AATTAAAATA 
AAATAAAATA AAAACTCTAC CTCAAACCGC GGCTAGAGGG CTAGGGGATA CCAGCTGAGA 
GTCATGTTAG ACGAGACTAC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
CCTCCAGCGA CTCATCACGC GCTCGTTTTA AATTCGATGT TGTTCCGTTC CGAACTGGCT 
GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC GACGAAGCGC TACATGCCCG 
GTCTATATGC GCAACTGTAA CTAATAACTG ATCAATAATT ATCATTAGTT AATGCCCCAG 
TAATCAAGTA TCGGGTATAT ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG 
ACCGACTGGC GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 
TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGCCA TTTGACGGGT 
GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG GGATAACTGC AGTTACTGCC 
ATTTACCGGG GGGACCGTAA TACGGGTCAT GTACTGGAAT ACCCTGAAAG GATGAACCGT 
CATGTAGATG CATAATCAGT AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT 
ACCCGCACCT ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT TGTTGAGGCG 
GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC CAGATATATT CGTCTCGAGA 
GACCGATTGA TCTCTTGGGT GACGAATGAC CGAATAGCTT TAATTATGCT GAGTGATATC 
CCTCTGGGTT CGAA 
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